I've been looking at the posts in this forum, but couldn't find anything.
My extended attributes are in UTF-8, the default used by . I want to search for Unicode characters and replace them with their corresponding Unicode number
For example
If my string looks like "Well-Done", where assume the hyphen is an "en-dash" (&endash

. I want to search for this character and replace it with it's number 8211 (–

.
This is just one of the character. I wanted to come up with a regex that can generally search for unicode characters and replace them with their corresponding numbers.
Will appreciate any help or direction.
Thanks
Amit