Converting UNICODE to shiftJIS [closed] - unicode

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Closed 6 years ago.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Improve this question
I have a Japanese client and have generated a large flat file (1.2 million rows) of data to send to them.
The file is UTF-8 encoded, which supports storing and displaying all the Japanese characters. The client wishes to receive this file in a shiftJIS encoded format that's designed for Japanese characters.
From the wikipedia page I can get the conversion logic
I see online converters such as motobit that let you convert encodings.
My issue is that my file is quite large and I will have to do this for several hundred more files repetitively. The copy-paste field on the online converter tool won't scale to that size and isn't quick enough.
Does anyone know of a free desktop application or perhaps even a ruby library that I could use to convert encodings? Or any other suggestions?
Thanks!

No need for any tool or utility, just use the gedit to convert your files.
Follows the steps mentioned below:
Open your file in gedit which you wish to convert, whatever format it may be in.
Copy all the contents of the file and paste in a new gedit document.
Now, save the file. In the save dialog thus opened select the character encoding as SHIFT_JIS before saving your file, attached is the screenshot below for this:
Change the line ending if you want to. If you do not see the SHIFT_JIS in the select options, then click on the Add or Remove button which is present just below it.
In the dialog thus opened, select the SHIFT_JIS from the available encodings in the left column and then click on the Add button. Once added to the encoding menu, select it and save it.

I guess what you want might be the nkf, Network Kanji Filter.
You can convert a file from utf-8 into shift-jis like this:
% nkf -s file-utf8.txt > file-sjis.txt
manual page:
http://linuxcommand.org/man_pages/nkf1.html
wikipedia:
http://en.wikipedia.org/wiki/Network_Kanji_Filter
You can install nkf like this:
% sudo yum install nkf
% sudo port install nkf
% brew install nkf
Hope this helps.

Related

Is there a PDF document of the current core Perl 5 version (eg. 5.34)? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 1 year ago.
The community reviewed whether to reopen this question 1 year ago and left it closed:
Original close reason(s) were not resolved
Improve this question
This used to be a super simple no-brainer. For example, if I were looking for python documentation, I'd first do a google search for:
python documentation pdf
... and voila! The first result is exactly what's needed: A single downloadable containing PDFs for Python. There are 27 files, but one
of them documents Python core.
But I'm not looking for a comprehensive Python documentation. I'm looking for the same type of document, but for Perl. Doing the same google search for:
Perl documentation pdf
(or similar queries) would show references to either perl.org, perldoc.perl.org, perl.developpez.com, perl.mines-albi.fr, etc, but none gives a download link for a single comprehensive PDF that documents core Perl.
There used to be a downloadable large PDF file (built from a large manpage), which has disappeared since some 5.x version. I know that all the pieces of this core Perl document are available online in perl.org, perldoc.perl.org, and offline in perldoc. That's wonderful
for people who want to repeatedly manually query on a terminal or browser. I'm simply asking
where can I find a single PDF file that contains the entire core
Perl documentation, in the same way that there's one for:
Python (as mentioned above. See: https://docs.python.org/3/archives/python-3.9.7-docs-pdf-letter.zip, in the zip as "reference.pdf")
MySQL (https://downloads.mysql.com/docs/refman-8.0-en.pdf)
JavaScript/ECMAScript (https://www.ecma-international.org/wp-content/uploads/ECMA-262_12th_edition_june_2021.pdf)
Sorry if I'm missing something obvious.
PLEASE NOTE: I'm NOT looking for "book recommendations," as the SO review of my question had asserted (and subsequently labeled the question as inappropriate). I know the books. If I wanted
them, I would have bought them from Amazon.
You were looking for the current perl documentation and you got it.
perldoc.perl.org has it.
There's no single pdf, single html or single chm anymore (though various libraries to produce one).
perldoc for core perl consists of hundreds of documents, and with all the CPAN libraries you are at >100.000 documents. Plus there exist dozen or so perl books which are available as pdf.
There's no single perlbook, rather perlbook has a list of most books about perl.

Unicode sample text file for testing for Unicode related problems? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I am looking for a sample text unicode file (UTF-8) that can be used for testing different problems related with text encoding and decoding including:
low ascii character usage, like first 32 codes
characters outside BMP
NFC related issues
XML encoding/decoding issues
Mainly I want to copy the text into clipboard, paste it in an HTML text-area of the application, and be able to retrieve it from a page after.
This would enable to identify different Unicode related problems that could occur at decoding, encoding or even database level.
This page has been used to test web browsers, with texts in several scripts: https://www.kermitproject.org/utf8.html
The Gothic entry for "I can eat glass" in particular is outside of BMP: 𐌼𐌰𐌲 πŒ²πŒ»πŒ΄πƒ πŒΉΜˆπ„πŒ°πŒ½, 𐌽𐌹 πŒΌπŒΉπƒ π…πŒΏ 𐌽𐌳𐌰𐌽 πŒ±π‚πŒΉπŒ²πŒ²πŒΉπŒΈ.
Normalization forms and XML processing are usually not problematic when moving data around, so there are no common samples that test those two in particular.

Documentation in md, pdf and html format [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
Improve this question
I'm working on an opensource project on Github. I'd created some documents in Github's md format. However I want those documents available in three formats:
PDF: To be downloaded with project
HTML: To be hosted on my personal site.
Markdown (.md): For Github.
Obviously I'd not prefer to write them thrice. Is there any way I write it once anywhere (although MS Word preferred) and it could be converted to other two formats?
Consider Pandoc
I'd write in Markdown and convert to the others.
Another possibility is DITA. Its free reference implementation, the DITA Open Toolkit, lets you generate HTML and PDF out of the box, and can be customized to generate Markdown as well.
But, DITA might be a more sophisticated solution than you need, depending on your requirements for content reuse, the size of your docs, how frequently you update, whether you will be employing a technical writer to maintain and update your docs... It's a powerful solution better suited for a dedicated documentation effort than a one time ad-hoc situation.

looking for a good online WYSIWYG editor with RTF download capability [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 8 years ago.
Improve this question
Does anyone know of any JavaScript/CSS scripts that offer a good WYSIWYG editor, along with the ability to download inputted information to an RTF file?
The only one I've found so far is NicEdit, but I haven't been able the figure out how to convert the formatted text (which is in HTML format) to RTF and offer it for download. I've been able to store the formatted HTML in a JavaScript variable, but don't know how to proceed from there.
Do any of you know of an open source WYSIWYG text editor that offers users the ability to download their formatted text in RTF format?
In the end I decided to go for TinyMCE. It's pretty versatile and although downloading text was not a built-in function, it was easy to set it up myself just by passing the formatted HTML text through a PHP variable once a button was pushed for download.
From there, there are various php classes that can be used to convert HTML to RTF. Just updating this in case anyone else comes up with a similar problem in the future.

Online editor for word documents [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
Can anyone recommend a really good online wysiwyg editor? I'm looking for the following features:
saving of document in a Word-compatible format (.doc, .docx, .rtf)
accurate cut/paste from Word
support of most Word features, including paragraph numbering
customizable. I need to add my own special handling for changes
Because the app will be used in a company intranet, it's not vital that it runs in all browsers.
have you tried office live or google docs?
OfficeLive
googledocs
Try Inetword
I found it is best for online Document editing, and does almost all functionality done by MS office.
You might wish to consider Native Documents (a commercial solution, new in 2018): https://www.nativedocuments.com which we have designed to be easy to embed. (Disclosure: I have an interest here)
saving of document in a Word-compatible format (.doc, .docx, .rtf)
Supports loading of doc and docx. (Not RTF yet). Save/export to docx or pdf.
support of most Word features, including paragraph numbering
High fidelity layout/rendering of Word documents is a key feature. Its good enough to be used for PDF Conversion.
customizable. I need to add my own special handling for changes
The web client is written in ReactJS, which you can customize.