Suvarna Garge (Editor)

Xapian

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Developer(s)
  
xapian.org

Operating system
  
Cross-platform

Written in
  
C++

Type
  
Search and index API.

Stable release
  
1.4.2 / December 26, 2016; 3 months ago (2016-12-26)

License
  
GNU General Public License

Xapian is an open source probabilistic information retrieval library, released under the GNU General Public License (GPL). It is a full text search engine library for programmers.

It is written in C++, with bindings to allow use from Perl, Python, PHP, Java, Tcl, C#, Ruby, and Lua. Xapian is highly portable and runs on Linux, OS X, FreeBSD, NetBSD, OpenBSD, Solaris, HP-UX, Tru64, IRIX, Windows, Hurd, and OS/2.

Xapian allows developers to add advanced indexing and search facilities to their own applications. Organisations and projects using Xapian include Debian, Gmane, Die Zeit, Delicious, MoinMoin, and One Laptop per Child.

Features

  • Transactions: if database update fails in the middle of a transaction, the database is guaranteed to remain in a consistent state.
  • Simultaneous search and update, with new documents being immediately visible.
  • Support for large databases: Xapian has been proven to scale to hundreds of millions of documents.
  • Accurate probabilistic ranking: more relevant documents are listed first.
  • Phrase and proximity searching.
  • Relevance feedback, which improves ranking and can expand a query, find related documents, categorise documents etc.
  • Structured Boolean queries, e.g. "race AND condition NOT horse"
  • Wildcard search, e.g. "wiki*"
  • Spelling correction
  • Synonyms
  • Omega, a packaged solution for adding a search engine to a web site or intranet. Omega can easily be extended and adapted to fit changing requirements.
  • References

    Xapian Wikipedia