Heirloom mailx doesn't display UTF-8 characters

Heirloom mailx doesn't display UTF-8 characters - encoding

I'm using Heirloom mailx 12.5-2.
When I'm receiving a mail with characters like "ä", "ö" or "ü" mailx can't display these and just shows "�".
The mails charset is "iso-8859-1".
I already tried to set the options "ttycharset", "sendcharsets" to "iso-8859-1" or "UTF-8", but it doesn't help.
Also the options "print-all-chars", "encoding" or "reply-in-same-charset" don't solve the problem.
My locale and terminal are on UTF-8.
Does anyone has an other idea?
Thanx

Related

Unix mail automatically sending my message as an attachment

I have a mail command that looks something like this:
sh ~/filter.sh ~/build/site_build.err | mail -s "Site updated." me\#site.com
The bash script has a few commands, mostly grep, to filter the stderr. It sends the filtered content to me when our build process has finished.
It used to send the file contents in the message body, however now it's sending as an attachment, which I do not want.
When I delete most of the text it sends to me in the message body once again, so I've reasoned that this has to do with the message size -- is this correct?
Anyway, how can I prevent Unix from sending the message contents as an attachment, no matter the circumstances?
Thanks!

Still unsure as many things could cause mail to apparently send input as attachement. But in many Linux distribution, mail is an alias for is heirloom mailx. It is known to encode its input if there are any non printable characters in it.
As soon as it finds non standards characters, it adds the following headers to the mail :
Content-Type: application/octet-stream
Content-Transfert-Encoding: base64
and the text of the mail is effectively base 64 encoded. It is not a true attachement, but many mail readers treat such mails as an empty body with an unnamed attached file.
If your LANG environment variable declares a locale that can use non 7 bits chars (éèûôüö ...) mail seems to be clever enough to declare an extended charset (ISO-8859-1 for fr locale) and to do quoted-printable encoding. So even with (at least west european) non 7bits ASCII characters in the message, provided there are no control chararacters, the mail should be sent normally.
The alternative would be to filter all non printable characters from the output of filter.sh, and the last chance solution would be not to use mail and directly use sendmail.
(Ref : an answer of mine to an other post but that one concerned java)

What's ï»¿ sign at the beginning of my source file?

I have a PHP source file where ï»¿ characters automatically got added in! I don't know from where they have come. I'm not getting any parse errors but it results in weird behavior in the execution of the file. E.g. header location functionality is not working sometimes! I'm curious how these kind of symbols are getting auto generated? I'm using UTF-8 encoding & the sign ï»¿ is not showing in Notepad++ or Windows Notepad but with Netbeans IDE.
Eg. Code:
ï»¿<?php
echo "no errors!";
header("Location: http://stackoverflow.com");
exit;
?>
What is this? How can I prevent it?

You propably save the files as UTF-8 with BOM. You should save them as UTF-8 without BOM.

It's called Byte Order Mark, and doesn't always have to be "ï»¿". http://en.wikipedia.org/wiki/Byte_order_mark
Some Windows applications add BOM by default. In Notepad++ you can use some options in the Encoding menu like Encode in UTF without BOM or Convert to UTF without BOM.

I believe that whether you save it UTF-8 with or without BOM it still happens. I don't think it makes a difference.
Try it, see if it helps.

From a tool like vi or vim, you can modify and save the file without a BOM with the two following commands :
:setlocal nobomb
and then
:w

Saving outlook mail items in UTF-8 / Unicode using C#

We have created an Outlook Plugin which (amongst other things) can be used to save Mail items in text form to a specific folder. However, the text of the resulting text file is encoded in ANSI and I would like to save it as UTF8. I have already set the Codepage of the mail item like so:
mail = (MailItem)objItem;
mail.InternetCodepage = 65001; // equal UTF8 encoding; see http://msdn.microsoft.com/en-us/library/office/ff860730.aspx
mail.SaveAs(filePath, olSaveAsType);
However, the resulting file is saved as "ANSI as UTF8" and all extended characters (e.g. in Arabic or Russian) come out as question marks.
Does anyone know how I can save the mail item in utf8?
Thanks a lot.
Cheers,
Martin

Instead of trying to set the encoding, try reading InternetCodepage and then using a System.Text.Encoding object to read the saved file into a string. You could then convert and re-save the string as another file in the encoding you prefer.

TEXT in Arabic showing wrong characters instead

I've a Character Encoding issue.
I've a text file written in arabic, when i open it I get weird characters.. like this åÇÜÇáÍÌÑäÇáÑÝÇÚíãäí..
Is there any way to fix this and get a correct text? the text file where it is written is utf8x encoded.

As in the comment: it is not UTF8, it is WINDOWS-1256 encoding, so you can repair it on Linux using iconv command for file test:
jh#jh-aspire:4804~$ iconv -fwindows-1256 -tutf8 test
هاـالحجرنالرفاعيمني
(I have no idea what it means as I don't know Arabic)

You can use Notepad++ for showing the Arabic texts.

text file encoding shows correctly on terminal, not in editor

I got a problem with the encoding of a text file.
If I open it with *nix terminal tools like less, cat or more, accented characters are shown correctly.
But if I open it with any editor (e.g. vim), accented characters are scrambled.
My terminal locale is set tu UTF-8, my editor (vim) has his default encoding set to UTF-8. If I open textfile.txt with vim I see scrambled accents either I set vim encoding to UTF-8 or ISO8859-1.
The output of the file utility is:
$ file textfile.txt
textfile.txt: ISO-8859 English text, with very long lines
I already tried the following with iconv:
iconv -f iso-8859-1 -t utf-8 textfile.txt > textfile.utf8.txt
I get this
$ file textfile.utf8.txt
textfile.utf8.txt: UTF-8 Unicode English text, with very long lines
Opening it with vim keeps showing scrambled accents, and this time accents are scrambled even if I use cat or more.
My goal is to get this file in UTF-8 format and, obviously, showing correctly the accented characters.
[The brute way to do this is to copy every single output screen of the command "more", and paste it into an editor. There must be a smarter way to do this.]
Thanks for any help.

It turned out that the file contained characters from two different encodings, that's why visualizations were so scrambled in every case, and iconv didn't manage to successfully convert the file. Thanks everyone anyway

We Keep Coding

iphone swift flutter scala powershell matlab mongodb postgresql perl eclipse

Heirloom mailx doesn't display UTF-8 characters - encoding

Related

Unix mail automatically sending my message as an attachment

What's ï»¿ sign at the beginning of my source file?

Saving outlook mail items in UTF-8 / Unicode using C#

TEXT in Arabic showing wrong characters instead

text file encoding shows correctly on terminal, not in editor

Categories

Resources