Pig Latin
E829914
Pig Latin is a high-level data flow language used with Apache Pig to analyze and process large datasets in Hadoop-based big data environments.
Statements (51)
| Predicate | Object |
|---|---|
| instanceOf |
Apache Pig component
ⓘ
data flow language ⓘ high-level programming language ⓘ |
| abstractionLevel | high-level ⓘ |
| advantage |
reduces amount of MapReduce code
ⓘ
simplifies complex data pipelines ⓘ |
| comparedTo | SQL NERFINISHED ⓘ |
| compiledBy | Pig interpreter ⓘ |
| designedFor |
data processing
ⓘ
large-scale data analysis ⓘ |
| developedFor | Yahoo! NERFINISHED ⓘ |
| differenceFromSQL | procedural rather than declarative ⓘ |
| executesAs | MapReduce jobs ⓘ |
| hasExtensionMechanism |
UDFs in Java
ⓘ
UDFs in Python (Jython) ⓘ UDFs in other JVM languages ⓘ |
| hasFeature |
automatic optimization
ⓘ
bag data type ⓘ data type system ⓘ lazy evaluation ⓘ map data type ⓘ nested data structures ⓘ relation abstraction ⓘ schema support ⓘ tuple data type ⓘ |
| integratesWith |
HBase
NERFINISHED
ⓘ
HDFS NERFINISHED ⓘ Hive NERFINISHED ⓘ local file system ⓘ |
| paradigm |
data flow
ⓘ
procedural ⓘ |
| runsOn | Hadoop NERFINISHED ⓘ |
| supports |
ETL workflows
ⓘ
data aggregation ⓘ data transformation ⓘ filtering operations ⓘ grouping ⓘ joins ⓘ sorting ⓘ user-defined functions ⓘ |
| supportsExecutionMode |
MapReduce mode
ⓘ
Spark mode ⓘ Tez mode ⓘ local mode ⓘ |
| typicalUseCase |
clickstream analysis
ⓘ
data cleansing ⓘ log processing ⓘ |
| usedBy |
data analysts
ⓘ
data engineers ⓘ |
| usedIn | big data environments ⓘ |
| usedWith | Apache Pig NERFINISHED ⓘ |
Referenced by (1)
Full triples — surface form annotated when it differs from this entity's canonical label.