VSCode pdf-editor extension - visual-studio-code

I would like to develop an extension to VSCode that would auto-detect *.pdf files (By PDF filename extension or by reading magic bytes)
The extension will then know the PDF file format and will allow the opening the "binary" pdf file and allow for simple text editing
What is the way in vs code to "auto-detect" file by types?
And, Once detected what is the proper way to open a binary file for editing purposes?

Related

Import video file into vscode

I have converted a m4v file type into a mp4 file type with https://cloudconvert.com. Currently in my Visual Studio Code set up I have assests/video/videoName.mp4. When I click on the mp4 file I get a message saying "The file is cannot be displayed in the editor because it is binary or uses unsupported text encoding". Any tips? I may be overlooking a small detail.
you need to download the MPEG-4 PREVIEW extension, and then it will open normally

Is it possible to write a binary file import extension for vs code?

I want to display some informations of a binary file in vs code.
Is it possible to write an extension for vs code, such that when selecting that file in the Explorer (or opening it directly) you see some text extracted from the binary file by that extension?
So the core functionality of that extension would be (simplified) a binary to text converter.
Any suggestions?
The VS Code team member has confirmed they do not have support for registering content providers for binary files in my issue.
I've inspected the workspace.onDidOpenTextDocument and window.onDidChangeActiveTextEditor APIs, but neither seems to be called when opening binary files.
Is there a way to display fallback content using registerTextDocumentContentProvider (or otherwise) for binary files?
That's why these types all carry Text in their names, TextEditor, TextDocument, etc. They can only handle textual, not binary data ;-)
No explanation as to why this works for PDFs, probably special-cased.

Programmatically convert Doc(x) files to PDF using Microsoft Word

We are developing a Java application that needs to programmatically convert .rtf, .doc and .docx files to PDF files.
Formatting is important to us, so we need the page numbers to be the same between a source file and a target PDF file, and the contents of each page being the same as the original file.
We have tried out open source solutions, such as JODConverter to invoke a LibreOffice of OpenOffice installation, Docx4j and XDocReport. The best formatting was achieved with LibreOffice. However, even in that case, the pages were different (for example, a 87-page .rtf file results in an 80-page PDF file).
So, we think that the ideal way to make the conversion would be to somehow invoke Microsoft Word though our Java application, and make the conversion with it. That would produce PDF files that have the same formatting as the original files.
Is this possible in any of the following ways:
An API that is directly invokeable through Java?
An API that is invokeable through a .Net language and we would use that with something like JACOB?
A 3rd party library that uses a Microsoft Word installation under the hood (something like JODConverter for Word)?
A CLI interface supported by Word (relevant question)?
Something else?

Identify docx pptx xlsx and other msoffice files correctly rather than applicaton/zip

Currently, im working on a perl script that should allow a file to be uploaded, and then process it, and then send it to a printer.
http://code.google.com/p/web-printer-http/
Apparently, both zip and microsoft office files ending in "...x" identify themselves as application/zip when using /usr/bin/file.
Any idea to differ them?
The differing should not be based on extension since the uploading system may be unabe to use extensions (like mobile phones and such).
Rather, it should judge the content entirely based on file content.
There's no way to distinguish between a .zip file and an Office Open XML file (like .docx, xslx, .pptx, etc) without peeking inside the zip.
All of the various open XML formats use .ZIP compression. Therefore they are zip files.
If the zipfile conforms to the Open Packaging Convention, then it may also be an Open XML (Office) document.
A good heuristic is if the zip file contains a file called /[Content_Types].xml, then it is an OPC file. To determine if the zip file contains a file of that name, you must read the zipfile directory. Use a zip library to do that.

how to read .rtf file from the FTP server?

I know how to read text file from the ftp server.I have used this logic for rtf file.But it gives wrong out put.Is it possible to read rtf file from the server?If possible then please help me.
You need software that understands the rtf file format. Microsoft Word does. Most plain text editors will not be able to show you the contents of a file in rtf format.
Google for rtf format
Added: didn't notice that you're on an iPhone platform. Google for "rtf iphone" and you'll see several choices including http://prmac.com/release-id-4180.htm