SchemaGen
E457344
SchemaGen is a TensorFlow Extended (TFX) component that automatically infers and generates data schemas by analyzing example datasets for use in machine learning pipelines.
Statements (50)
| Predicate | Object |
|---|---|
| instanceOf |
TFX component
ⓘ
machine learning tooling component ⓘ software library ⓘ |
| analyzes | example datasets ⓘ |
| canUse |
schema inference heuristics
ⓘ
user-provided schema overrides ⓘ |
| configurable | True ⓘ |
| developedBy | Google NERFINISHED ⓘ |
| documentation | https://www.tensorflow.org/tfx/guide/schemagen ⓘ |
| domain |
ML pipelines
ⓘ
data engineering ⓘ machine learning ⓘ |
| implements |
automatic schema inference
ⓘ
data schema generation ⓘ |
| input |
TFX Example artifacts
ⓘ
statistics from StatisticsGen ⓘ |
| integratesWith |
Apache Airflow
NERFINISHED
ⓘ
Apache Beam NERFINISHED ⓘ Kubeflow Pipelines NERFINISHED ⓘ ML Metadata NERFINISHED ⓘ TFX orchestration systems ⓘ |
| license | Apache License 2.0 ⓘ |
| output |
TensorFlow Metadata schema
NERFINISHED
ⓘ
schema artifact ⓘ |
| outputFormat | TFMD Schema proto ⓘ |
| partOf |
TFX pipeline
ⓘ
TensorFlow Extended NERFINISHED ⓘ |
| programmingLanguage | Python ⓘ |
| purpose |
generate data schemas for machine learning pipelines
ⓘ
support data validation ⓘ support feature engineering ⓘ support model training ⓘ |
| repository | https://github.com/tensorflow/tfx ⓘ |
| supports |
boolean feature detection
ⓘ
categorical feature detection ⓘ domain inference ⓘ feature type inference ⓘ numeric feature detection ⓘ presence constraints inference ⓘ schema constraints specification ⓘ shape inference ⓘ sparse feature detection ⓘ string feature detection ⓘ |
| usedWith |
ExampleGen
NERFINISHED
ⓘ
ExampleValidator NERFINISHED ⓘ StatisticsGen NERFINISHED ⓘ TFX pipeline orchestration ⓘ TensorFlow NERFINISHED ⓘ Trainer ⓘ Transform ⓘ |
Referenced by (1)
Full triples — surface form annotated when it differs from this entity's canonical label.