SchemaGen

E457344

SchemaGen is a TensorFlow Extended (TFX) component that automatically infers and generates data schemas by analyzing example datasets for use in machine learning pipelines.

Try in SPARQL Jump to: Statements Referenced by

Statements (50)

Predicate Object
instanceOf TFX component
machine learning tooling component
software library
analyzes example datasets
canUse schema inference heuristics
user-provided schema overrides
configurable True
developedBy Google NERFINISHED
documentation https://www.tensorflow.org/tfx/guide/schemagen
domain ML pipelines
data engineering
machine learning
implements automatic schema inference
data schema generation
input TFX Example artifacts
statistics from StatisticsGen
integratesWith Apache Airflow NERFINISHED
Apache Beam NERFINISHED
Kubeflow Pipelines NERFINISHED
ML Metadata NERFINISHED
TFX orchestration systems
license Apache License 2.0
output TensorFlow Metadata schema NERFINISHED
schema artifact
outputFormat TFMD Schema proto
partOf TFX pipeline
TensorFlow Extended NERFINISHED
programmingLanguage Python
purpose generate data schemas for machine learning pipelines
support data validation
support feature engineering
support model training
repository https://github.com/tensorflow/tfx
supports boolean feature detection
categorical feature detection
domain inference
feature type inference
numeric feature detection
presence constraints inference
schema constraints specification
shape inference
sparse feature detection
string feature detection
usedWith ExampleGen NERFINISHED
ExampleValidator NERFINISHED
StatisticsGen NERFINISHED
TFX pipeline orchestration
TensorFlow NERFINISHED
Trainer
Transform

Referenced by (1)

Full triples — surface form annotated when it differs from this entity's canonical label.