Are there free, low cost, or open source tools for matching name/address data? [closed] - data-quality

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
This question is related to Tools for matching name/address data. There is a number commercial tools provided by SAS, Oracle, Microsoft, etc., that allow to de-duplicate or merging names of individuals or companies coming from multiple sources.
However, after reading the answers to the question mentioned before, I wondered why a seemingly interesting problem didn't receive any answers mentioning open source projects that could tackle the problem.
Are you aware of any open source projects or algorithms to implement the so called "record linking", "record merging", or "clustering"?

I'd recommend Google Refine as an open source (New BSD license) tool for parsing and fixing crufty data. It also allows clustering and reconciling of duplicate data, as well as having data-mining features.
I've used it to import and fix a lot of data in various formats, .csv, .tsv, .xls, .xml, .json, .rdf etc. with success. It can be used in-house without sending any data externally, which seemed to be a concern of the question "tools for matching name/address data"
NB. Google Refine was previously called Freebase Gridworks.

I stumble upon the following article: "Merge/Purge and Duplicate Detection".
By looking at http://www.semaphorecorp.com I found some extremely low prices.
This is not what I'm looking for, but at least is a bit of help, and a step on the right direction.

Try OSDQ open source data quality and profiling project on sourceforge

Related

Is there any tool to organize development notes and ideas? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 8 years ago.
Improve this question
I have been searching for some tool to organize my development notes, ideas and features on diferente projects.
I saw diferente tools that did part or what I need but nome combines all.
I know I can do it, but dont have the time.
Right now I use different tools to manage:
By Project
By versions
Features
Note
requests
bugs
etc
Basic features I whould like:
Organized by project
By version
What is it ( Note, idea, feature request, bug, etc
Import/Priority
Deadline
some sort of tag to catalog it
Code-snippet, to better illustrate
Image to better illustrate
I'd like more, these whould do for now.
Does any one know of some tool that those this, or most of it ?
Thanks
We are using Jira. Jira is a bugtracker, issuetracker etc... You can combine it with many add-ons (for scrum, github functionality)

Need to know about the best Managed File Transfer tools available in market considering all the major constraints [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 8 years ago.
Improve this question
Need to know about the best Managed File Transfer tools available (Open source or licensed) in market considering all the major constraints. Most importantly it should be used for enterprise integration with high availability. I have worked with some MFT tools but each has its own advantages and disadvantages.
I'm sure you've done the Google research on MFT solutions, so I will assume you're looking for personal opinions to validate the direction you're already headed.
I've evaluated Linoma, IpSwitch, Townsend, GlobalScape, Liaison, and Axway. I've got a comparison table somewhere that I put together for our analysis that outlined what I needed in a solution and found that Linoma best fit the bill.
I've now used Linoma's GoAnywhere MFT solution for a few years and am still very pleased with it. The company keeps up with the changing technologies, it has very powerful scripting capabilities yet keeps it simple to manage and administer. Key management, scheduling, alerts, logging, role management, HA integration, data manipulation, multi-platform, mobile app, and more are some of the many functions. But one of the best things about Linoma is their dedicated support team. They are the best I've worked with.
Here's a link to their information: http://www.goanywheremft.com/products/director
Good luck in your search.

Web Form Builder, Open Source, Drag and Drop [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
This is not a technical question, but I will try to be specific in order to this question to fit into the stackoverflow requirements.
I'm currently facing a task where I have to design hundreds of web forms, integrated in a java web project (JSP pages). I'm searching for a Web Form designer tool with the following requirements:
Drag and Drop interface (to speed up the process);
Open Source;
Database access (not mandatory but at the same time very usefull);
Java integration or Java based (again, not mandatory but at the same time very usefull);
Capable of exporting local files (the forms can not be alocated online, like in many form builders);
I wasn't able to find a tool to fit this requirements. Does anyone knows one?
Thanks
Unfortunately, no there isn't. You can find frameworks that acts as a wrapper and therefore it can save some development time, such as http://www.jformer.com/
There are a few proprietary applications around. Take a look at http://www.wufoo.com/
I've been looking for similar tool without any success. I am thinking about starting an open source project in these lines. But, I do not have anything concrete yet...

Suggestions needed on tools and technologies to use for building a Facebook-like website [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
With so many tools and technologies lying around, I am looking for suggestions around the best ones (UI/server-side frameworks/database/CMS) to use for building a web(site/app) similar to Facebook itself.
Details of the website cannot be revealed due to privacy concerns. But largely, the experience and interactions would be similar to what Facebook has (such as continuous feeds, groups, upload data/files, comments, etc.), just that it would be in a different domain.
Information (or links) on what technologies/frameworks are such sites/portals using will also be of great help!
Elgg is a great start. they have numerous plugins (some that even make it look very similar to facebook). I've seen some prototypes that where built in a few days that have tons of functionality
The simple answer is PHP. But people likely imagine a LAMP stack.
Facebook has reengineered the front side and back side of PHP, as I understand it. They use the HipHop compiler to cut the cost of execution of PHP. And I don't know the details, but they have some kind of backside distributed database they use instead of PHP/LAMP traditional use of MySQL.
(See http://www.facebook.com/note.php?note_id=24413138919 for a description of one of the mechanisms they use, Cassandra).
If you don't care about scale, you can skip these two steps and save a lot of engineering.

deployable wiki-based documentation [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
I am looking for a wiki project that is editable by developers and can have comments and history, much like everything else, but also has the following features:
A way to tag or version the wiki in an intuitive interface that any competent developer can use
A way to deploy a tagged or versioned snapshot of the wiki with the option of stripping it of any editorial history.
The use case is to have a team of developers able to fluidly update documentation in the lifecycle of a project and have the necessary internal dialogs, but then have a way to package the documentation in a polished way so that it can be included with a commercial product.
The ideal solution, if this software exists somewhere, would be to have some type of facility so that you can do say, PDF output to send to a commercial printer or have a way to do custom templates depending on the parameters of the deployment.
Does any sage developer out there know of such software?
I would take a look at GitHub's Gollum which seems to fit your requirements quite well. They also support a bunch of different markup alternatives, and both Markdown and Textile have converters to PDF (and probably a bunch of the other markup choices as well).