Is it possible to index custom documents in Faceted Search (ke_search) ? - typo3

My requirement is to index content of documents uploaded by user e.g PDF / DOC etc. Is it possible to index content of these documents by building custom indexers for the Faceted Search extension ke_search? If so, then can anyone provide any guide as to how would I create such indexers?
I am new to Typo3 so any help would be appreciated.

Hello and welcome to TYPO3 - the almost everything is possible CMS :-)
For your own indexeres you will need an own small extension and then register your indexers to the ke_search. Since the customers indexeres are written in PHP, you can use the whole power of PHP. So you just need some PHP libs on your server that are able to read the content of PDF and DOC files and then store the result to the TYPO3 database.
Check the docs:
https://www.typo3-macher.de/facettierte-suche-ke-search/dokumentation/ein-eigener-indexer/
Example configuration with an indexer:
https://github.com/teaminmedias-pluswerk/ke_search_hooks
EDIT: You should also check the build in indexers of ke_search. I guess there is already an indexer for PDF / DOC files included.

Related

Is it possible to embed form inputs into a rich text editor document model?

I am building a project with Django 2.2.4 and PostgreSQL 11.4. I am using JSON database fields to store data in JSON arrays.
In my app users create documents using "rich text editor" that provides standard text/image features already.
I also want to enable users create the document to drag and drop form inputs into the body of the document so that once the document is "published" other users can view the document an add input values to these fields before submitting and saving the document again.
Now, I'm trying to figure out conceptually the most efficient way to approach this.
I thought the first step would be to use an abstracted rich text editor which separates the document structure from the HTML, e.g. CKEditor or Quill; if I was to serialise the document with form inputs included I could in theory store templates in one JSONField and inputs in another.
This list is a really useful overview of various editors, but despite having read a lot of documentation it's not clear if this approach would be either correct or actually possible.
Does anyone have any similar experiences?
Pretty sure this isn't possible unfortunately.
Could you elaborate on what you're trying to do with this?

SharePoint 2013 Powershell - Moving Document Liobrary to New Site Collection

I am trying to move document libraries, from one site collection to another and retain of the metadata such as Modified By, Date Modified etc.
I have tried using the 'move' function within SharePoint 2013 Content and Structure but unfortunately this doesn't extend to across site collections.
I have tried multiple PowerShell scripts they successfully copy the file from one Site Collection to another but do not retain the metadata.
Any ideas, would be greatly appreciated.
Thanks.
have you tried saving the library as a template? You can mark the template to keep the contents.
Then you can deploy a new library from that template (Site actions / More options / Template name), and after that delete the template.
Additionally you can move it between servers. Go to the Site settings, there to the template gallery and download the template you just created. Now you can move it to another server or application.
Hope it helps.
Take a look at the SharePoint migration tools. Some of them offer free trials for limited amounts of data.

Publish Word documents with Sphinx

My project is transitioning its documentation from Word documents to Sphinx. An intranet page with the Sphinx output is already up and running.
What we'd like to do is have Sphinx also copy Word documents to the intranet page, and reference those with hyperlinks from the rst files. Preferably in such a way that the links are still readable in plaintext (for example, a reference to ./file.doc is perfectly acceptable).
The documents don't need to be indexed nor converted nor generated - just published.
Is there any way to get Sphinx to do this, or do I need to build some scripts manually?
The download text role can be used to do what you want. From the following markup, Sphinx will copy Test.docx to the _downloads subdirectory of the output directory and generate a link:
See :download:`this Word document <../Test.docx>`.

How to Export Moodle Course format to Dspace?

I need to export courses from Moodle but, but as it is a very closed
application, and the courses are in moodle format, is there any way
to extract the contents / metadata that format to facilitate the
migration to DSpace.
I know, it possible to make on the 'big-hand', but ira spend a lot of
time. For DSpace and moodle use very different and complex databases.
Moodle exports courses with a .mbz extension. Simply rename it to .zip and you can extract the XML files from inside. These files will have all the information you need. You could potentially create a tool that programmatically extracts this information and imports it to DSpace.
Also, Moodle is open source, not a closed application. Source available here: https://github.com/moodle/moodle

Search file content on iphone

Is there an Iphone SDK API to search resource files?
I have a set of html file resources that I'd like the user to be able to search in, but I want to avoid reading the files into memory and searching them one by one.
Is there any API that can help me do this?
No, you'll have to read the files in to search them. There's nothing like "Spotlight" on the phone.
I would suggest you using SQLite FTS3 extension for full text search through your documents. It has quite good capabilities for it. But you will need to strip HTML tags first so it can index only your text and don't bother with tags