Before POST-ing a form with text fields, I'm able to convert curly quotes from word into normal quotation marks with the following JavaScript snippet:
s = s.replace( /\u201c/g, '"' );
s = s.replace( /\u201d/g, '"' );
But I've recently encountered double opening/closing quotes as shown in brackets in the Question Title, does anyone know the unicode numbers for these?
Before POST-ing a form with text fields, I'm able to convert curly quotes from word into normal quotation marks with the following JavaScript snippet:
s = s.replace( /\u201c/g, '"' );
s = s.replace( /\u201d/g, '"' );
But I've recently encountered double opening/closing quotes as shown in brackets in the Question Title, does anyone know the unicode numbers for these?
Share Improve this question asked Dec 6, 2010 at 17:14 Level1CoderLevel1Coder 4917 silver badges18 bronze badges 1- What are you using on the server side? – Pekka Commented Dec 6, 2010 at 17:17
4 Answers
Reset to default 7U+201C and U+201D are the Unicode characters “
and ”
! You should already be catching them.
If you want to also pick up the single-quote characters ‘
and ’
and convert them to '
, that would be U+2018 and U+2019.
However, this kind of replacement is a Unicode Smell. What are you trying to do here and why? ‚‘’„“”«»–—
etc are perfectly valid characters and if your app can't handle them it won't be able to handle other non-ASCII characters either, which would generally be considered a Bad Thing. If at all possible, it is better to fix whatever problem these characters are currently triggering, rather than sweep it under the rug with a replacement.
You could easily find this out for yourself, in JavaScript, by using charCodeAt
. You could even do it in the Firebug console:
>>> "”".charCodeAt(0).toString(16)
201d
To toString
call at the end even converts it to hexadecimal for you. Remember to pad it with zeros if it's shorted than 4 digits.
Your code looks correct for unicode:
for start quote : U+201C
for end quote : U+201D
Source: http://www.cl.cam.ac.uk/~mgk25/ucs/quotes.html
HTML Escaped entities:
“ ”
Converted with this tool: http://u-n-i.co/de/