I am looking for large nfc type 2 tags - tags

I am looking to buy large nfc forum type 2 tags. At least 512 bytes, but preferably the largest 2040 byte tags that are theoretically possible according to the Nfc Forum Tag Type 2 specification. Does anybody know where I can purchase these large tags?

I'd suggest buying a Mifare Classic 4K tag or sticker.
Search for "Mifare 4K" on Google. Any contactless card or sticker you find should work, at least for testing purposes.

Related

Where can I find a lot of data to feed it an artificial intelligence?

I want to code an artificial intelligence. To teach her the language I can use Wikipedia offline but for teaching her communication I need other sources. Do you know big data sources which fit to this task and are free available? For example chat protocolls, mails, content of forums or something similar?
for teaching her communication I need other sources.
Some ideas:
Search video transcriptions on youtube (you may need to edit them for quality)
Search in your country political debates transcriptions (they maybe available for free on internet)
Search for theater plays dialogs in public domain

converting pdf to txt

I am trying to convert pdf's to text of the Flint water crisis emails from Gov. Snyder. Basically they have 20k+ pages of emails printed from Outlook and then scanned in as .pdf's. (Obnoxious, I know.) I have tried various tools like Tesseract (both directly and after having converted the .pdf to .tif with ImageMagik) and I just get a bunch of gobbledeegook.
Does anyone have any other suggestions for how to deal with these files? I am able to open them in Acrobat Reader and copy out all of the text, but the result is poorly and inconsistently formatted, making writing one script to clean it up very challenging.
Thanks in advance!
The quality of OCR directly depends on image quality, document formatting and layout, and the quality and proper configuration of the OCR technology you are using. As the complexity of the document increases, usually you will be steering away from free OCR to more powerful commercial solutions to achieve higher OCR results. IF you require formatting preservation, that alone exists only in a few commercial OCR applications. Any one of the major OCR providers is your answer.
Consider using OCR-IT web-based API (www.ocr-it.com) for this conversion. /one of the highest quality OCR on the market. I am one of original developers of that system, and our goal was to achieve top quality on the market.
Also, if this conversion is for a good cause and for people's benefit, by a non-profit organization, a non-commercial project, or just a goodwill personal project, my friends and I want to help. We volunteer and offer large-volume conversion for free. We contribute our skills and high-quality OCR software in return for a non-monetary compensation, such as a mention in your project, share about us to your circle, spreading the word about our goodwill, etc.

Mifare Plus reduced read/write range

I am using Mifare Plus cards instead of Mifare Classic and communications fail. It looks as if the Plus cards must be nearer to the reader than the Classic cards. Is this a known problem and are there any work-arounds short of ripping out existing equipment?

Resources for Scantron Cognition Enterprise?

I am using Scantron Cognition Enterprise at work to capture data from scanned forms. Building these forms is tedious at best, especially when it would be nice to have a library of pre-built objects to use. Unfortunately, documentation and on-line resources are scarce.
Does anyone have any pointers to find some resources for this tool?
Hey Jason, believe it or not, Scantron is STILL the standard, but this is not the Scantron you probably remember. Although OMR (bubble) forms are still used extensively in education, there are a lot more advanced technologies available to be added to them today.
Concerning Cognition, I looked through the available tags and these would fit:
"document-imaging" - Cognition is a document imaging product and can feed images and index values into most commercially available document storage applications
"OCR" - Optical Character Recognition, or reading machine print.
"ICR" - Intelligent Character Recognition - reading hand writing, usually in a constrained print format (one letter per box like a credt card application.
"datacollection" - the key purpose of Cognition is data collection.
However, there is not a tag for "OMR" - Optical Mark Recognition, or reading bubble choices, similar to the basic Scantron forms of the past. Also, I could not find one for "Key From Image", another purpose that Cognition is used for.
I am a Cognition user as well as someone who markets it and I know that there are a large number of users in North America. Many corporations that use Cognition use it for sensitive HR functions and so might not have their usage of it posted in a searchable format. Many other organizations use it for safety inspections, insurance data entry, and also for testing and surveys - basically anywhere you have a large number of paper forms and need all of the data quickly entered into a database. Many users are using Cognition for sensitive applications are so are not likely to share, but I can share a few I have, you could also contact your Scantron rep and they might have something they could share as well. I have some decent ICR fields built for name, e-mail, address, etc. The ICR fields are best when you build in your own dictionary or database look-ups. The OMR fields are the hard ones to build, but I have a few of these as well. The easiest way to share these is to send you the form that already has the field built into it. You can build your own lookups from txt, xls or db files.

Auto Categorization of Content

I'm developing a script that extracts the messages from the message archive of a particular meetup.com group of which I'm a member - http://www.meetup.com/opencoffee/messages/archive/
The idea is to dynamically add these to a wordpress site and allow people to search messages, auto tag messages etc.
The issue I have is how best to auto categorize these messages. I would welcome any thoughts and ideas of how best to go about this and what would be the most efficient way of programming this.
Option 1
Find a source of tags by subject area such as finance, technology, business etc by using the delicious API and find related tags by subject:-
http://delicious.com/tag/finance
http://delicious.com/tag/technology
if a message contains these tags then the message is assigned to the respective category.
I believe this could work but not sure the most efficient method of scanning the message for these tags.
Option 2
Find sites that are representative of the categories I need such as ft.com, the economist for finance etc, techcrunch for technology etc and then determine what tags are being used by people to tag these sites and determine by default that those tags are how people relate to these sites and their content stack.
Option 3
Pass the message url to http://semanticproxy.com/ (part of Reuters Calais project) or use the Open Calais API. This I have tried but without much success as the variable depth of content is not always sufficient to return meaningful taxonomy.
Here is an example message that I parsed through the calais api:-
Original Message
http://www.meetup.com/opencoffee/messages/6045615/
Calais Result
http://www.mashinteractive.com/opencoffee/calais.php
SUMMARY
So That's about it. I would welcome any thoughts and ideas on methodology and tips on how best to approach the message scanning for options 1 and 2.
FYI there are approximately, 1,700 messages to date and I'm guessing I may have 10 categories with each category being defined by 20 or 30 tags.
If anyone would like to help develop a Wordpress plugin or class to do this I would be more than happy to have you on board. Bear in mind I'm not a programmer, I just tinker around the edges and pretend I am one.
Thanks in advance
Jonathan
CEO
Crowd People
You may want to check out Zemanta, which has tools and plugins (including Wordpress) for auto-tagging content, and also have a look at Common Tag, which is a vocabulary for expressing tags on content using RDFa, a semantic web standard currently indexed by some search engines.