Can someone please give me the trained data file of IAM handwriting database for use with tesseract.I need the .traineddata file please.
Related
Could someone please guide me on how to extract a .docx file and load it onto a database using an ETL(Extract-Transform-Load) or ELT(Extract-Load-Transform) tool?
Assuming that the .docx file contains mostly unstructured data, isn't it an ELT tool I should go for instead of ETL?
The ETL and ELT tools I found this far didn't support the MS Word component. What other way is there to extract and store the content in a .docx file onto a database?
My requirement is to:
Extract the data inside the .docx file,
Convert them into meaningful data, and
Store them onto a data lake so I can perform data analysis, and take productive decisions based on those results.
It's just like how e-commerce companies convert customer reviews into meaningful data so they can take decisions to boost their sales. In my case, it's Word files I need to analyze.
I'm asking this because I've searched for so many ETL and ELT tools but couldn't find anything that supported Word files. Maybe it's because I haven't been searching for the right tool or the right way to do it?
If somebody knows a way, please guide me through the process. What should I start looking for? A tool, or a way to code the entire thing?
I've been looking for an answer for weeks now but didn't find a helpful answer. And it's starting to get really frustrating to see all the tools supporting every other component like social media, MongoDB, or whatever EXCEPT Word files.
You have to do this in 2 steps:
Extract the data from the .docx file to txt or xml
Now use SSIS to import. (Azure Data Factory if you are in the cloud)
Installing and using any trainer for tesseract
I was searching for a solution to train my tesseract and create a language file, now for 3 months, am not a professional programmer so am trying to learn, I need it to build an automatic solution for a project I have, but I didn't found any tutor video or any topic about installing any train extension or software and use , am using spider and python 3 , and have download qt,tests and other but how to use.
I need tutor any documentary can help, like box creating, export and test my file as language.
You can use JTessBoxEditor.
1)Provide the image
2)Generate boxes
3)Train your tesseract with generated boxes.
These are the steps.
Also you can train using txt file. Here is tutorial about it.
https://www.youtube.com/watch?v=i_1-hGsXxy8
I want to add a new font to my language trained data file. I can build a new trained data file, and it works fine but is there any way that I could expand the old trained data file with the new one?
I have seen here someone seems to do it... I am not able to check it since I am on Windows...
https://github.com/CavemanNinja/shell_scripts/blob/master/traintess.sh
I have a large ERD diagram done in MySQL Workbench and I also have a large format printer. Does anyone know of a way to convert this project to a JPG so that I can print it?
Thank you.
Open your ERD and then go to File->Export and you'll find there that you can export the whole model to PNG or SVG which you can convert later to JPEG if you want to.
I have application which exports data in csv file which is stored in Document's Directory.
Also I have enabled File Sharing in my app with iTunes so that whenever i will connect my iPhone with iTunes it will allow me to save csv file. But my data needs security, so i want generate Password for my CSV file, So that whenever we open file in computer, it will ask for Password. How to achieve generating Password protected file in iPhone??
Well, you would have to create a disk image (.dmg file) in order to have the file password protected once it was on your computer. I don't think this is possible in iOS. I think your only solution would be to use an encryption/decryption algorithm. Store encrypted data into the file, and use your algorithm to decrypt the file once it is on your desktop. Here is a solution for encrypting/decrypting NSData.
How about it doing it this way :
Use some generic encrypting algorithm
Either generate a key or ask from user (as per your needs ) while generating csv
Then when its copied to dekstop ask user to download a utility to decrypt..
Or create simple online decrptor in php .. should be simple enough..
Or use existing : http://www.tools4noobs.com/online_tools/decrypt/