Skip to main content
Ctrl+K
PyFlink 1.20+vvr.11.7.dev0 documentation - Home PyFlink 1.20+vvr.11.7.dev0 documentation - Home
  • API Reference
  • Examples
  • API Reference
  • Examples

Section Navigation

  • PyFlink Table
  • PyFlink DataStream
    • StreamExecutionEnvironment
    • DataStream
    • Functions
    • State
    • Timer
    • Window
    • Checkpoint
    • Side Outputs
    • Asynchronous I/O
    • Connectors
    • Formats
  • PyFlink DataFrame
  • PyFlink Common
  • API Reference
  • PyFlink DataStream
  • Formats
  • pyflink.datastream.formats.csv.CsvReaderFormat

pyflink.datastream.formats.csv.CsvReaderFormat#

class CsvReaderFormat(j_csv_format)[source]#

The StreamFormat for reading csv files.

Example:

>>> schema = CsvSchema.builder() \
...     .add_number_column('id', number_type=DataTypes.INT()) \
...     .add_string_column('name') \
...     .add_array_column('list', ',', element_type=DataTypes.STRING()) \
...     .set_column_separator('|') \
...     .set_escape_char('\\') \
...     .set_use_header() \
...     .set_strict_headers() \
...     .build()
>>> source = FileSource.for_record_stream_format(
...     CsvReaderFormat.for_schema(schema), CSV_FILE_PATH).build()
>>> ds = env.from_source(source, WatermarkStrategy.no_watermarks(), 'csv-source')
>>> # the type of records is Types.ROW_NAMED(['id', 'name', 'list'],
>>> #   [Types.INT(), Types.STRING(), Types.LIST(Types.STRING())])

Added in version 1.16.0.

Methods

for_schema(schema)

Builds a CsvReaderFormat using CsvSchema.

text_line_format([charset_name])

Creates a reader format that text lines from a file.

previous

pyflink.datastream.formats.csv.CsvSchemaBuilder

next

pyflink.datastream.formats.csv.CsvBulkWriters

On this page
  • CsvReaderFormat

This Page

  • Show Source

Created using Sphinx 7.4.7.

Built with the PyData Sphinx Theme 0.16.1.