Apache Lucene

E358078

information retrieval library open-source software search engine library

Apache Lucene is a high-performance, full-featured text search engine library written in Java and widely used as the core indexing and search technology in many applications and search platforms.

Try in SPARQL Jump to: Surface forms Disambiguation Statements Elicitation Referenced by

All labels observed (3)

Label	Occurrences
Apache Lucene canonical	4
Apache Lucene project	1
Lucene index	1

How this entity was disambiguated

This entity first appeared as the object of triple T3418864 — resolving that mention is where its identity was fixed. The disambiguator weighed these candidate entities and picked the highlighted one (or “None”, minting a new entity). This is how homonymy is resolved: the same surface form can point to different entities.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07

Target entity: Apache Lucene
Context triple: [Apache Software Foundation, overseesProject, Apache Lucene]

A. Apache Mahout
Apache Mahout is an open-source machine learning library designed to build scalable algorithms for clustering, classification, and recommendation on large datasets, often leveraging big data platforms.
B. Apache HBase
Apache HBase is a distributed, scalable, NoSQL database designed for real-time read/write access to large datasets, typically running on top of the Hadoop ecosystem.
C. xVelocity in-memory analytics engine
xVelocity in-memory analytics engine is a columnar, in-memory data processing engine developed by Microsoft to enable fast, compressed, and scalable analytical querying for business intelligence tools.
D. Hadoop
Hadoop is an open-source framework that enables distributed storage and parallel processing of large data sets across clusters of commodity hardware.
E. Apache Hive
Apache Hive is a data warehouse and SQL-like query system built on top of Hadoop for managing and analyzing large datasets stored in distributed storage.
F. None of above. chosen
G. Unsure - the case is ambiguous/there is not enough information to decide.

NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07

Target entity: Apache Lucene
Target entity description: Apache Lucene is a high-performance, full-featured text search engine library written in Java and widely used as the core indexing and search technology in many applications and search platforms.

A. Apache Mahout
Apache Mahout is an open-source machine learning library designed to build scalable algorithms for clustering, classification, and recommendation on large datasets, often leveraging big data platforms.
B. Apache HBase
Apache HBase is a distributed, scalable, NoSQL database designed for real-time read/write access to large datasets, typically running on top of the Hadoop ecosystem.
C. xVelocity in-memory analytics engine
xVelocity in-memory analytics engine is a columnar, in-memory data processing engine developed by Microsoft to enable fast, compressed, and scalable analytical querying for business intelligence tools.
D. Hadoop
Hadoop is an open-source framework that enables distributed storage and parallel processing of large data sets across clusters of commodity hardware.
E. Apache Hive
Apache Hive is a data warehouse and SQL-like query system built on top of Hadoop for managing and analyzing large datasets stored in distributed storage.
F. None of above. chosen

Statements (52)

Predicate	Object
instanceOf	information retrieval library ⓘ open-source software ⓘ search engine library ⓘ
coreOf	Apache Solr ⓘ Elasticsearch ⓘ OpenSearch ⓘ
developer	Apache Software Foundation ⓘ
feature	Boolean queries ⓘ custom scoring ⓘ faceted search support ⓘ filtering ⓘ full-text search ⓘ fuzzy queries ⓘ highlighting ⓘ index compression ⓘ indexing ⓘ near real-time search ⓘ phrase queries ⓘ pluggable analyzers ⓘ range queries ⓘ ranking ⓘ scoring ⓘ segment-based index structure ⓘ sorting ⓘ stemming ⓘ tokenization ⓘ wildcard queries ⓘ
genre	full-text search ⓘ text indexing ⓘ
implements	inverted index ⓘ
influenced	Apache Solr ⓘ Elasticsearch ⓘ OpenSearch ⓘ
initialReleaseYear	early 2000s ⓘ
license	Apache License 2.0 ⓘ
operatingSystem	cross-platform ⓘ
organization	Apache Software Foundation ⓘ
originalAuthor	Doug Cutting ⓘ
partOf	Apache Lucene self-linksurface differs ⓘ surface form: Apache Lucene project
programmingLanguage	Java ⓘ
repository	https://github.com/apache/lucene ⓘ
supports	BM25 ranking algorithm ⓘ vector search (kNN) in recent versions ⓘ
supportsLanguage	English ⓘ multiple natural languages ⓘ
useCase	application search ⓘ content management systems ⓘ document management systems ⓘ enterprise search ⓘ log search ⓘ website search ⓘ
writtenIn	Java ⓘ

How these facts were elicited

Referenced by (6)

Full triples — surface form annotated when it differs from this entity's canonical label.

Apache Software Foundation → overseesProject → Apache Lucene ⓘ

ASF → governs → Apache Lucene ⓘ

subject surface form: Apache Software Foundation

Apache Lucene → partOf → Apache Lucene self-linksurface differs ⓘ

this entity surface form: Apache Lucene project

Apache Solr → basedOn → Apache Lucene ⓘ

Apache Solr → usesIndexEngine → Apache Lucene ⓘ

this entity surface form: Lucene index

ApacheCon → isRelatedTo → Apache Lucene ⓘ

All labels observed (3)

How this entity was disambiguated Show

Statements (52)

How these facts were elicited Show

Referenced by (6)

How this entity was disambiguated

How these facts were elicited