XML Search Engine - SkXml-J
SkXml-J - a powerful and flexible search engine for checking any XML data
- Do you have to search and update large XML-based datasets, especially as part of your daily, weekly or monthly time-sensitive tasks?
- Do you find that you need more flexibility to check both structure and content, even when you are still building up your data?
- Are you missing business opportunities because you can't guarantee your data quality?
Get back in control of your data with IDM's SkXml-J search & retrieval engine
SkXml-J is the documentary and editorial primary module for exploring and extracting valuable information from large collections of semi-structured data. SkXml-J delivers search features that give users the best of both worlds: the ability to search for XML structure, and at the same time full text search features which can combine with or ignore XML structure. You can strengthen the accuracy and consistency of your searches by exploiting the XML structure, but you can also mix in all classical full-text operators (boolean, proximity, regular expressions, case and accent sensitivity, etc).
Features
The features of the SkXml-J search engine are:
- Global features
- full XML and Unicode support
- boolean operators
- ability to fetch various items of a document as an array of results
- Xml operators
- tag containing something
- attribute containing something
- ability to ignore tagging in a text phrase
- Word operators
- exact phrase
- inflected forms
- regular expression
- case sensitivity, accent sensitivity
- proximity
Input data: full XML and Unicode support
SkXml-J indexes any sets of XML documents thanks to its own SkProd data processing chain. For instance, it can go through a directory of well-formed XML files, which do not need to be homogeneous in their DTDs. In fact, a DTD is not even required for SkXml-J to work. Oxford University Press uses the SkProd every night to index the 1.3GB content of the Oxford English Dictionary entries. This indexing is completed within three and a half hours, so that 50 users can make use of the updated content for their work the following day.
Results list in tabular form
SkXml-J returns a list of results formatted into a table. The user can define the number and the nature of the columns through the query language in order to display the segment of the data he is interested in. The user can then manipulate the results list by sorting and filtering the table.

