Searching in Lucene .Net - lucene.net

I have used Lucene .Net for Indexing and using StandardAnalyzer to at time of Indexing. Now I want to search say 'attach'. In document 'attached' is there. How i get the successful hit for word 'attach'. Please help me as soon as possible.

Reducing word forms to their roots is called "stemming" in search engine software. Look at the first answer to How to enable stemming when searching using lucene.net? for a few options for stemmers using Lucene.net.

Related

Fuzzy search on MongoDB using Java

can you please suggest if fuzzy search is possible on MongoDB using Java ? I could do exact search after creating a text index but need to implement fuzzy search. e.g. if I search for app, I should get both app and apple. I'm not using Mongo Atlas.

How to implement full text search in NoSQL database?

Has anyone tried to deploy NoSQL database with Full Text Search feature?
I read a lot of topics here in StackOverFlow and some other sites but they were all in 2011 and 2012 which I think there are a lot of updates to this moment.
I have a project that requires a full text search feature and I am trying to pick the right NoSQL database.
I am thinking also of ElasticSearch and Solr to enable this feature?
Is MongoDB Full Text Search Feature working fine? or it has performance and scalability issues?
Thanks,
Elasticsearch is working pretty good. You can use analyzers to stem your text. Also you can store your data in JSON format. It has "match_phrase" function allowing you to make full text search in a field that you want to search. Take a look at that.You can find documentations here : https://www.elastic.co/guide/en/elasticsearch/reference/current/search-search.html
The MongoDb full text search feature works well from version 2.6, however, its full text search feature relies on a text index and you can only add one such index per collection. Depending on your exact requirements, this may be too limiting. Check the MongoDB documentation for more details. If you need a more flexible full text search functionality, Elasticsearch will be a better option.

Google custom search engine and partial matching

I plugged in the Google Custom Search Engine to my MediaWiki site. It seems to work fine. However, how do I also make it search for results using partial matching? For example: when I searched for 'loft', it returned only the pages containing the whole word 'loft', but I was also looking for the pages containing 'loft' as a substring of some words, like 'createloft', 'deleteloft', 'loftstudy', etc.
Google doesn't provide such advanced search features. If you need things like per-namespace search, substring matching, regex search etc. use CirrusSearch, which is based on ElasticSearch.

Is there some search engine SDK or API that I can use for local search?

I have many documents located on my disk and I want to build a search engine to search through them.
I know Google Desktop Search or Bing Desktop Search could do that. But I want to know if there's some SDK/API to do that so I can do some customization.
What I want to achieve, is that I can provide a document and the local search engine will return all the documents similar to it.
In general there are Lucene and Solr that can help to solve search related needs in Java (I guess you are using Java based on the tag GWT).
But I don't know how to do a search by example with these tools. I think you have to extract the relevant information of the document to construct a search based on it.

Google like autosuggest with Solr

I'm currently using Solr with Terms Component and Jquery Autosuggest which works quiet good. However, this construct is limited to one autosuggest word (it autocompletes only the first word). Is it possible to implement a Google like autosuggest with multiple words/terms so i can autocomplete multiple words?
I just wrote a blog post about different ways to make auto complete suggestions with Solr. It's basically a comparison of some different strategies, check it out, it might help.
If you want to make multiple terms suggestions, it turns out you should use, as already mentioned in the other answer you got, the Suggester component available in Solr starting from the 3.1 version. Since it has some limitations, you can also have a look at the ShingleFilterFactory, which generates token NGrams. It creates combination of tokens as a single token, that's useful to suggest multiple words.
You can also use SpellCheckComponent for better auto complete suggests.
See http://wiki.apache.org/solr/Suggester for details.
Edit:
Refer here for solr5 and above
Check this out, it might help
http://www.lucidimagination.com/blog/2009/09/08/auto-suggest-from-popular-queries-using-edgengrams/
I worked out the same task, finally got solved using TermsComponent with multiple fields. check this link. http://lucene.472066.n3.nabble.com/auto-completion-search-with-solr-using-NGrams-in-SOLR-td3998559i20.html