TReSy is an XML search engine oriented to text retrieval.
At the beginning TReSy project was born to satisfy the needs of CRIBeCu connected to the activity of digital library in the field of Cultural Heritage.
Now it is a project in progress, in fact it is oriented to innovative research elements connected with information processing field and with software applied to cultural heritage. In order to describe TReSy it is useful to contrast the kind of results it gives with the result of other systems.
Many XML engines return XML elements (nodes), other engines (like Google) refine and return a subset of documents starting from a big collection of documents. On the opposite, TReSy returns a reference to the text occurrence.
Among the software packages which are oriented to text retrieval from XML documents, TReSy is particularly well suited for developing software solutions for digital libraries. The TReSy features are oriented to full-text pattern search over big amounts of documents, and it allows to easily combine each element of the result with the local XML information.
TReSy is applied to write interface software to deal with XML documents collections; the features of this interface are:
- management of huge amount of documents
- full text search, designed to retrieve with the same efficiency Words, portion of word, word beginnings, word ends (String-BTree)
- visualization of the occurrences resulting from a query in the context of original documents
- use of XML mark-up to refine result and to build frequency lists
- high query performance, which is preserved even when the amount of data grows up
Contact: Michele Paoli