I wanted to write an Android and/or an iPhone app that entails taking a picture of something (right now, I just want to limit to text) after which the app parses the text to make use of it. For example, perhaps taking picture of a sentence (or may be just fragments) will be then parsed by the app to bring up more information about the book. Title, author, ISBN etc. And even may be information about other books that are similar in content to this book.
Is this possible to do something like this? Is there an API that exists already that parses the content of an image? How is an image stored in Android and iPhone? Is it possible to implement the app in one platform and not the other?
I'd appreciate any input or advice that you guys have to offer. Thank you!
You're looking for this, possibly.
It's called OCR, or Optical Character Recognition.
Also check out ZXing a great library for decoding one- and two-dimensional barcodes. There are both iPhone and Android versions.
Related
I am trying to find a solution for this AR app as the topic tells.
I want my app to recognize a hand-written number by the user.
The app will tell the user to write down for example number 24 on a paper and move the camera over the written number to see the 3d object.
This might be used for saving a Birthday, a wedding date .. etc
For accuracy, the app instructions will show the user a preview to tell please write the number 24 similar to this..
Although each hand writing will differ, but at least we do not get curly "2"-s or "4" with an open edge ..etc
So here we need AR to recognize the number, or be able to read the number according to approximation.
And the first question is: Is such a behavior doable or anyone familiar with a similar concept?
After searching similar apps, I found "Ink Hunter" apps for tatoo preview-s, although these apps use symbols not number, but we can think of a number as a symbol as well.
Also as this video: https://www.youtube.com/watch?v=9rXJcIE2Fcs shows, each user draws the symbol in a different way and still they get it working.
I am using Unity3d and Vuforia.
Vuforia offers free samples(unity3d packages) on the website, and there is one named "Text Recognition" , and here's the tutorial link: https://www.youtube.com/watch?v=W3MK6nC5FWE
But unfortunately couldn't make it work.
If someone has developed such a functionality using these sample projects from vuforia or have any ulternative method please I need you help :)
thanks in advance moghes
Here's a tutorial our team created on text recognition using the Hololens and Vuforia with Unity: https://www.youtube.com/watch?v=WdMeHgD4fMY. In the first portion of the video, we show how to get text recognition working with just Vuforia and Unity - no Hololens required. For your application, just change the text to numbers.
I believe the biggest challenge you will have is the "hand-written" component. From our research, Vuforia prefers computer-generated, predefined font types.
I am working on an app that needs to translate the text of an image in realtime with the iphone camera.Is there any way to implement it? any sdk or tutorial will be helpful.
My suggestion would be a combination of the following:
The open-source Tesseract OCR engine for getting the text from the image(Quite recent iOS wrapper here: https://github.com/ldiqual/tesseract-ios)
One of the translation services discussed in this question for translations: https://stackoverflow.com/questions/6151668/alternative-to-google-translate-api
E.g. a tutorial like this on how to get a real-time camera view with overlays: http://www.musicalgeometry.com/?p=1273
Please note that these are just ideas on how to do this with as fast progress as possible.
Some apps that offer real-time translations even try to find a suitable font and display the translated text at the exact same position as the original text was. I am afraid that this is not possible without investing lots of work and developing your own OCR engine.
Hope this helps.
I have used the library for reading barcode from image. But can't find any source or help for reading numbers from image.
I just need to take photo with camera and then process image whether any numbers exist and read out the numbers.
Is there any way to get the number written on image?
that's OCR type of problem. you might want to take a look at: Is there any good OCR API for iPhone and/or Android phones? or iPhone: Regarding OCR technology?
The Google Docs API has some OCR support. You may want to take a look at this and see if you can implement it: http://googlesystem.blogspot.com/2009/09/google-docs-ocr.html I don't remember if Apple will allow you to use any external libraries though.
I wanted to ask how does Bar code recognition apps works (like Goggles)?
Where do they get the bar code information?
There are a variety of standard formats for barcodes. Typically, a barcode app will (somehow) apply some image processing to determine which kind of barcode it's looking at, and then interpret the image accordingly. Figuring out which kind of barcode boils down to making an educated guess based on some set of heuristics.
Once you know which kind of barcode you're working with, it's much easier to process it.
See also: QR Code#Standards.
Google Goggles and Google Shopper use a (modified) version of the ZXing library for barcode scanning. Not surprising as it is all from Google. The barcode data comes from internal sources and from sites on the web.
Just a quick question on the iphone technology within this business card reader
http://www.youtube.com/watch?v=F8z6pcxdrPo
as we can see this video allows users to take a photo of a business card, i have an idea where i would take a photo of some text , and that photo could then be turned into text on the iphone. how would i be able to implement this using the iOS API ?
cheers guys
The camera stuff is all standard-- use the UIImagePickerController for this.
Text recognition (OCR) is not a built in part of the iOS API, though, so that part really isn't trivial. There are multiple open-source projects that can handle this sort of thing if you want to go after them.
Tesseract is an older but possibly viable one. Check out this post which has info on cross compiling it for iOS.
Other users here might have more current recommendations.