Apache Hive
E185675
Apache Hive is a data warehouse and SQL-like query system built on top of Hadoop for managing and analyzing large datasets stored in distributed storage.
All labels observed (2)
| Label | Occurrences |
|---|---|
| Apache Hive canonical | 11 |
| Hive | 2 |
Statements (62)
| Predicate | Object |
|---|---|
| instanceOf |
SQL query engine
ⓘ
data warehouse software ⓘ open-source software ⓘ |
| developer |
Apache Software Foundation
ⓘ
Facebook ⓘ |
| domain |
big data analytics
ⓘ
distributed data processing ⓘ |
| donatedTo | Apache Software Foundation ⓘ |
| donationYear | 2008 ⓘ |
| feature |
ACID transactions
ⓘ
JDBC driver ⓘ ODBC driver ⓘ bucketing ⓘ cost-based optimizer ⓘ materialized views ⓘ metastore ⓘ partitioning ⓘ schema-on-read ⓘ user-defined functions ⓘ vectorized query execution ⓘ |
| inception | 2007 ⓘ |
| introducedBy | Facebook Data Infrastructure Team ⓘ |
| license | Apache License 2.0 ⓘ |
| operatingSystem | cross-platform ⓘ |
| partOf |
Hadoop
ⓘ
surface form:
Apache Hadoop ecosystem
|
| primaryUse |
ETL processing
ⓘ
batch processing of large datasets ⓘ data warehousing ⓘ |
| programmingLanguage |
C++
ⓘ
Java ⓘ Python ⓘ |
| repository | https://github.com/apache/hive ⓘ |
| runsOn |
Apache Spark
ⓘ
Apache Tez ⓘ YARN ⓘ
surface form:
Hadoop YARN
|
| supportsConcept |
UDAF
ⓘ
UDF ⓘ UDTF ⓘ external tables ⓘ indexes ⓘ managed tables ⓘ views ⓘ |
| supportsFileFormat |
Avro
ⓘ
JSON ⓘ ORC ⓘ Parquet ⓘ RCFile ⓘ SequenceFile ⓘ Text ⓘ |
| supportsLanguage |
HiveQL
ⓘ
SQL-like query language ⓘ |
| supportsPlatform |
Amazon S3
ⓘ
Azure Data Lake Storage ⓘ Google Cloud Storage ⓘ HDFS ⓘ
surface form:
Hadoop Distributed File System
|
| topLevelProjectOf | Apache Software Foundation ⓘ |
| usesComponent |
Beeline
ⓘ
CLI shell ⓘ Hive Metastore ⓘ HiveServer2 ⓘ |
| website | https://hive.apache.org/ ⓘ |
| writtenIn | Java ⓘ |
Referenced by (13)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
Hive
this entity surface form:
Hive