💾 Archived View for thebird.nl › gn-gemtext-threads › issues › full-text-search.gmi captured on 2023-05-24 at 17:54:36. Gemini links have been rewritten to link to archived content
⬅️ Previous capture (2023-03-20)
-=-=-=-=-=-=-
Implement full text search on the dumped RDF using xapian. This may be a feature to be implemented in the dump-genenetwork-database repo using guile-xapian.
dump-genenetwork-database repository
We also considered that virtuoso supports full text search in SPARQL. But, the query syntax is not as natural as xapian, and there may be no stemming. So, this option was rejected.
An initial full text search has been implemented. This initial implementation indexes data from the SQL database and provides an API to search that index. See the index-genenetwork script and the search API implementation in the genenetwork3 repository.
https://github.com/genenetwork/genenetwork3/blob/main/scripts/index-genenetwork
https://github.com/genenetwork/genenetwork3/blob/main/gn3/api/search.py
In the future, full text search must index RDF rather than data from SQL. Please open a new issue or reopen this issue when working on that.