Spooling Directory Source
E702196
Spooling Directory Source is an Apache Flume component that reliably ingests files by monitoring a designated directory and processing new files as they appear.
Statements (47)
| Predicate | Object |
|---|---|
| instanceOf |
Apache Flume source
ⓘ
software component ⓘ |
| avoids | reprocessing of completed files ⓘ |
| category |
data ingestion
ⓘ
log collection ⓘ |
| configurationProperty |
basenameHeader
ⓘ
basenameHeaderKey ⓘ batchSize ⓘ bufferMaxLineLength ⓘ bufferMaxLines ⓘ consumeOrder ⓘ decodeErrorPolicy ⓘ deletePolicy ⓘ fileHeader ⓘ fileHeaderKey ⓘ fileSuffix ⓘ ignorePattern ⓘ inputCharset ⓘ maxBackoff ⓘ spoolDir ⓘ trackerDir ⓘ |
| deletePolicyOption |
immediate
ⓘ
never ⓘ |
| deletePolicyOption | suffix ⓘ |
| designedFor | reliable ingestion of files appearing in a directory ⓘ |
| documentation | Apache Flume User Guide NERFINISHED ⓘ |
| implements | reliable file ingestion ⓘ |
| input | files in spool directory ⓘ |
| language | Java NERFINISHED ⓘ |
| monitors | designated directory ⓘ |
| output | Flume events ⓘ |
| partOf | Apache Flume NERFINISHED ⓘ |
| processingModel |
directory spooling
ⓘ
file-based ingestion ⓘ |
| processingOrderOption |
oldest
ⓘ
random ⓘ youngest ⓘ |
| reliabilityFeature |
exactly-once file ingestion semantics (under documented constraints)
ⓘ
no re-ingestion of completed files ⓘ tracks file processing state ⓘ |
| softwareFramework | Apache Flume NERFINISHED ⓘ |
| supports |
line-oriented file processing
ⓘ
log file ingestion ⓘ text file ingestion ⓘ |
| tracksStateIn | tracker directory ⓘ |
| uses | Flume channel ⓘ |
| watchesFor | new files ⓘ |
Referenced by (1)
Full triples — surface form annotated when it differs from this entity's canonical label.