From the TinyMCE PowerPaste plugin documentation :
If you configure PowerPaste to allow local images (see the
powerpaste_allow_local_images setting below) then images pasted from
Microsoft Word and other sources will appear in TinyMCE as Base64
encoded images.
It's not clear what and other sources means. Does it include things copied over from a browser?
From https://stackoverflow.com/a/39842881/329660 I assume that an image pasted as part of a chunk of HTML can't be pasted as binary data in TinyMCE.
But if the user right clicks the image and "copy image" from their browser, is PowerPaste supposed to paste the full Base64 data in the editor (granted that the browser put the data in the correct format in the clipboard in the first place, of course)?
The "other sources" for this is typically just Excel. Word and Excel are the two office products that consistently place image binaries into the clipboard in RTF format that PowerPaste can extract.
If another office product did the same PowerPaste would see that and grab it as well but at present I don't know of another office product that does exactly what Word and Excel do with images.
Related
Native Language Characters are truncated when you start typing them or paste them in form fields using Adobe. If I open it on chrome, it works fine. Currently I am using iTextSharp to populate the data properly to the pdf form. But as soon as I click on the field, font changes and the data gets truncated. Screenshot is attached.
Note: Javascripts from PDF are already removed.
EDIT: PDF is available on https://gofile.io/?c=6nO7ul
Test Characters: 汉字汉
In our application we are using Itext Pdf 5.5.3 library.
We have checked with some of the pdfs in which Checkboxes displayed correctly(check/uncheck) .
However there are some pdf with RadioBoxes and do not display radiobutton(on/off) correctly.
I also use this link to validate pdfs and java code
String[] values = form.getAppearanceStates("Checkbox");
return null values.
Also tried Itext RUPS and found that pdf which are working shows Form Field Names in RUPS Form Tab. And PDfs which are not working do not display form fields.
I tried generating pdf from word document and it doesn't display form fields in RUP , neither I can check/uncheck checkbox in Adobe Acrobat Reader.
What could be the solution to display radiobutton with check on / off ?
Edit -
I had created sample web application to reproduce the issue.
Please setup attached web application and let me know the fix for the issue.
Please download from this link
You have successfully discovered the difference between interactive PDF forms and "flat" PDF documents that look like a form to the human eye, but that aren't interactive forms.
To make the "flat" forms interactive, you need to open those flat documents in PDF editing software (e.g. Adobe Acrobat) and you need to add a form field manually.
You can ask Acrobat to guess where it should add fields, but Acrobat will be wrong in many cases for obvious reasons. You always need a human if you want it to be done correctly.
As for creating an interactive PDF from Word... Forget about it. Use OpenOffice or LibreOffice.
I work for a publisher and am trying to extract content from our fully laid out PDFs. I've tried pdftohtml, pdftotext, pdfminer, and other Python-based approaches to getting the content, as well as saving to Word, HTML, XML, etc. from the original Acrobat files.
I don't need just the text, I also need the text formatting. That's because, for example, I need all the blue text in the document.
When I save to HTML, Word, etc. from Acrobat, the resulting files contain screenshots of the pages, not the laid out text. When I extract text using different Python modules I get the text but lose the text formatting.
The only solution I've found is to manually copy and paste from the PDF into a word doc, then saving as HTML. I'm hoping to automate this.
Why does copying from Acrobat into Word achieve what I can't do by other means? Has anybody come across this problem before?
Maybe you can consider another method. The software (https://pdfapi.codeplex.com/) can convert pdf files to html directly via MVS. If you are able to use the MVS, i think the software i mentioned above is useful for you to convert the text in pdf files to html that can keep the format perfectly. Of course, it's just a referral, you can have a try.
I know that forms in .pdf made by Adobe Acrobat can act like questionnaire or application blank for filling input fileds and saving results, but is it possible in a way to add more interactiviry to them
- add file upload or attachment to resulting document?
It's kind of possible.
You'll need to add Javascript code to your documents. The code should show Browse For File dialog and import selected file to the document. Please have a look at Import Named File Attachment script (at the bottom of the page). I doubt that this solution will work in Adobe Reader or alternative PDF viewers, but Acrobat Professional should do the trick.
Another solution is given is this thread on Adobe forums:
...I figured out a roundabout way to attach files in adobe reader (not
the full blow adobe acrobat reader). Via Adobe Acrobat 9 Pro I
created a blank form with the "comment/mark up" tools enabled. Then I
merged that file with the other PDF form that was formatted with
fields, radial buttons etc. I had to create the blank form for the
attachments separately because if I tried to enable the "comment/mark
up' tool on the form with fields and radial buttons it disabled the
fields. ... The comment markup tool then allowed me to attach files
via the "attach file as a comment" option.
I am creating a PHP generated report page (using a back-end DB for values), which contains images. This page is then copied and pasted to MS Word, where I run a few macros on it to generate some figures.
I used to use MS Word 2003 earlier and there was no problem with the copy of the page and images, but when I upgraded to MS Word 2010, the images do not get copied.
The PHP page is password protected and uses HTTPS, where I have to enter a Username and Password for the final report page to be displayed. In Word 2003, the user dialog used to appear, but it does not do the same for Word 2010.
Is there some setting in Word 2010, which I need to enable for the images to be shown and/or the user/password dialog prompt to appear?
One way which I have found is to copy the content to a New Outlook message window. This way all the images and text comes up, then copy from there onto the MS Word document. Not sure how and why, but both the images and text show up there, although the alignment is a bit skewed.
(I even tried un-checking the box for "Show Image Placeholders" in the Advance options of MS Word 2010, but still no effect.)