languages supported by lucene.net - lucene.net

-What all languages supported by Lucene.net?
-For eg. chinese,russian,japanese,arabic,french,german etc.
-If not, is there is any way to index other languages?

Lucene is not limited to English, nor any other language. To index text properly, you need to use an Analyzer appropriate for the language of the text you are indexing. Lucene's default Analyzers work well for English. There are a number of other Analyzers in Lucene Sandbox, including those for Chinese, Japanese, and Korean.
Here is a list of analyzer which you can integrated with your Lucene.NET:
https://svn.apache.org/repos/asf/incubator/lucene.net/trunk/src/contrib/Analyzers/
As you can see above lots of language are already included in the analyzers and if not included, go ahead and create it because if it is not listed only means, no one tried it yet, however it does not mean it is not supported.
Also read this SO discussion on Indexing multi-lingual content with Lucene.net

Here is the latest version.
https://lucene.apache.org/core/8_3_1/analyzers-common/index.html
Google the latest version of Lucene and change the version number in the link for the latest versions in time of your search.

Related

How to check if the grammar or syntax of a sentence is correct or not (simple grammar check in Swift)?

Hi I am working on testing a mobile app that can generate a description about objects or scenes. Is there a way to check if the sentence generated by the app has the correct grammar/syntax? I am using Swift.
No, there is no built-in support for checking the grammar of a sentence in Swift, nor am I aware of anything like that built into iOS. English grammar is quite complex, and then there's the issue of supporting multiple languages. It's a non-trivial problem.
You might be able to find a third party grammar checking library. I suggest googling "iOS grammar checking framework" or similar.

Does Confluence support Chinese language?

I am looking to know whether Confluence apps - surveys, documents, blogs etc. support Chinese characters.
thanks
As far as I know Chinese is not listed in default language packs. However, you can always install the language pack in Confluence. Please take a look at this document for further details.

Hebrew dictionary for PostgreSQL on Heroku?

Reading Heroku help on enabling full text search in PostgreSQL I see that it doesn't support Hebrew by default. Does anyone know how to add support for Hebrew dictionary in PostgreSQL on Heroku?
I work on Heroku Postgres, and would like input on this matter from those able to vend it.
I'm looking into this but so far the waters have been murky as to how Hebrew is supported in many/any open source projects, including dedicated full text searching projects like Lucene or Xapian. There are full blown toolchains for dealing with this, but their integration with PostgreSQL is not yet existent as far as I know, e.g. hebstem, hspell and libhspell, HebMorph.
If someone knows of what the current state of the art is for this in Postgres, I can try to make it work on Heroku at a time of my discretion, depending on the precise details of that implementation that I have to review somewhat carefully.
As-is my attempts to locate an ispell dictionary have been questionable, as is the efficacy of ispell style dictionaries for Hebrew given the reportedly very different stemming rules.
Related work:
Lucene Hebrew analyzer, which links to HebMorph
Xapan Hspell Integration, but it's not clear if this ever got fully fleshed out (at the time Xapian was working on extensibility in this area)
Thoughts?

Tools for manual translation of Constants/Messages .properties files

I'm looking for some tools that could be used by human translators during the process of translating our GWT application into other languages.
Currently, we have the English version of .properties files containing constants and messages, and need create the files for other languages. This tool should be easy to use, so even non-IT-lover can master it.
Or, do you suggest other method for translation of the texts?
I heard the "community" approach becomes quite popular, by that I mean that one uploads his texts to some (?) forum, and the community there creates the translations into other language - but as I said, I don't know much about this
Are there any online platforms for this purpose?
any other ideas?
See my SO answer for VB 6 source code, speech text is in french want to translate to english. The same answer works if you replace the computer langauge "VB6" by "JavaScript".

Does PostgreSQL have Good Multilanguage Support on Windows?

I am about to use postgres for my website but want to know if it has good multilanguage support before I spend time with it.
We are currently using PostgeSQL on multilingual site and using it's search compatibilities. It's pretty nice in fact. Text search, however required a little bit of configuration.
The fulltext search comes with the same set of builtin dictionaries as on Unix/Linux, which is pretty good coverage. If these cover your needs, you will have the exact same functionality.
If you require custom dictionaries, they also work equally well on Windows, but you are likely going to have a harder time building the installation of them there.
For fulltext search you can use the ispell dictionaries from http://wiki.services.openoffice.org/wiki/Dictionaries
The process of installation them into PostgreSQL is described here (unforunately in ukrainian, but if you are native with Russian or Linux :) you will understand this): http://grandse.org.ua/messages/show/60
http://developer.postgresql.org/pgdocs/postgres/textsearch-dictionaries.html
Here is the common docs for TSearch2: http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/