Odd characters in wysiwyg - character

I am using Redactor.
When pasting in text whether its from Word or Textedit I am getting really odd characters after saving.
For example an apostrophe is being changed to ’ and any time I open that page and re save it adds even more characters like this ‚¢ÃƒÂÂ
The page uses utf-8 charset
Any help would be great. Thank you

Related

GWT internationalization accent characters not displaying correctly

I have a few different languages in a gwt project. For some reason the accent characters for locale_fr.properties works fine but the accent characters in locale_es.properties do not display correctly (show as "?" in a triangle). The host html page is encoding using utf-8 as well as the project .xml file, so it doesn't seems like the issue is with what type of encoding I'm using. Can't seem to figure out why the accent characters would work for one language but not another. No difference in how the .properties files are set up from what I can tell...any suggestions on what could be causing this issue?

Sublime 3 - show bad hidden characters

So I recently switched from Sublime 2 to 3 and I can't get it to show some 'bad' characters (like the encoding/pasting) bugs. Why I need it:
Sometimes somebody sends me a piece of code (snippet or google analytics code or whatever) via email. I know you are not supposed to send it via email because the mail may add some not wanted characters into code but people still do it. So person sends me some URL via email which ends like this: /⁠org - you notice that 'bug' between '/' and 'org'? I am sure some of you know how its called.
But when I take this "/⁠org" and paste it into the .php file or .html file or whatever, then Sublime Text 3 will no longer display it! It will just show "/org" like that bug is not even there - but it is there and that url/code/whatever is of course wrong and its invisible/impossible to find it!
So how can i force Sublime 3 to show these bugs? That's all I need really, just to know when I paste accidentally some bad/corrupted piece of code/string into my script, that there actually IS something bad/corrupted. How can I make Sublime to display everything the same way pastebin does?
Gremlins plugin for Sublime Text 3 does exactly that (it helps to “identify invisible and ambiguous Unicode whitespace characters”). Zero-width characters are marked with an icon in the gutter.
This appears to meet your needs (if not every one of your desires).
Unicode character highlighter for Sublime Text 2 & 3
This plug-in highlights characters such as non breakable space, characters that often break compilers and scripts and is almost impossible to spot in the editor.

Solr.NET and Chinese characters

I have a search index that handles both English and Chinese content. All documents are imported to SOLR by Solr.NET
When I search the Chinese content from the browser (localhost:8389/solr/...) everything works fine, but when I execute the search using Solr.NET then I get 0 hit :(
I tried to set up a logger to see what's the difference between browser search and Solr.NET search but I get question marks (??????) instead of Chinese characters.
Any help would be greatly appreciated!
Open solr0.log with your browser instead of Notepad++ in order to make sure that this is a real problem and to find out which encoding the file was saved as - if it's not Unicode, chances are that Solr.NET can't search it.
Notepad++ has been known to have trouble with Chinese, either because of the encoding (Notepad++ doesn't support typical Chinese encodings like gb or big5) or because of the default font, which may or may not support Chinese characters. Browsers are more forgiving and will do anything in order to display the file correctly. In fact, to convert non-unicode Asian encodings to unicode, I often open a file in my browser, which will detect the correct encoding and display the raw contents, and then CTRL+A CTRL+C CTRL+V everything into Editor, save as UTF-8, done.

How can I properly display Vietnamese characters in ColdFusion?

I having a hard trying to properly display Vietnamese text in ColdFusion. I've proper charset set to UTF-8 but still no luck. The same texts work fine in a HTML page. What else am I missing? Any suggestion would be much appreciated.
Html:
ColdFusion:
Thanks!
There are two things you need to watch out for, as far as I recall of the top of my head.
The first is to ensure that the .cfm file itself is saved as UTF-8 - this is a file system option, and will probably be settable in your editor. This ensures that the UTF-8 characters are correctly preserved when saving the file.
The other is that every .cfm file that includes any UTF-8 text should start with:
<cfprocessingdirective pageencoding="utf-8" />
This ensures that ColdFusion delivers the page to the browser in the correct format.
Just to be sure, when you display your working HTML, can you check the page encoding used by your browser (ie. in FireFox you can right-click+page Info). Maybe your text is not UTF-8 encoded that could explain the problem...

UTF8 encoding problem, same results work fine in wordpress

I have a wordpress installation that clients can edit, all characters display ok. On the main homepage I query the same database for the same title and post content, but it doesn't display correctly - just a question mark
I have tried sending the utf8 headers manually, through htaccess and through meta tags. I have used SET name UTF8 (which turns the characters into the diamond symbol with a questionmark inside).
I genuinely cant figure out what it could be now and I really need these characters to display correctly.
Heres the homepage, you can see in the Sounddhism 6 preview that there are lots of question marks, if you click on it you will see what they are meant to look like
http://nottingham.subverb.net
I have passed it through the validator and it gives me this error:
Sorry, I am unable to validate this document because on line 373 it contained one or more bytes that I cannot interpret as utf-8 (in other words, the bytes found are not valid values in the specified Character Encoding). Please check both the content of the file and the character encoding indication.
The error was: utf8 "\xA0" does not map to Unicode
Which, i appreciate is supposed to help me, but I don't know what to do about it. Especially since that line, the letter generating the error is supposed to be a space and is AFTER the offending question marks.
Can anyone help?
Compare the encoding of both the back-end scripts in Wordpress and also your homepage script. If you're using IE, right-click the page and check the encoding. Sometimes it's set to "Auto-detect" and IE will often detect a different encoding for different pages, causing strange issues like this.
If you're not using IE, try using a tool like Fiddler to see exactly what encoding (and what bytes are being sent back and forth both in the back-end and your homepage script.
If forcing UTF-8 on your homepage script doesn't work, I would guess that the back-end is not using UTF-8.