Skip to main content
Ctrl+K
PyFlink 1.20+vvr.11.7.dev0 documentation - Home PyFlink 1.20+vvr.11.7.dev0 documentation - Home
  • API Reference
  • Examples
  • API Reference
  • Examples

Section Navigation

  • PyFlink Table
  • PyFlink DataStream
    • StreamExecutionEnvironment
    • DataStream
    • Functions
    • State
    • Timer
    • Window
    • Checkpoint
    • Side Outputs
    • Asynchronous I/O
    • Connectors
    • Formats
  • PyFlink DataFrame
  • PyFlink Common
  • API Reference
  • PyFlink DataStream
  • DataStream
  • pyflink.datastream.data_stream.AllWindowedStream.side_output_late_data

pyflink.datastream.data_stream.AllWindowedStream.side_output_late_data#

AllWindowedStream.side_output_late_data(output_tag: OutputTag) → AllWindowedStream[source]#

Send late arriving data to the side output identified by the given OutputTag. Data is considered late after the watermark has passed the end of the window plus the allowed lateness set using allowed_lateness().

You can get the stream of late data using get_side_output() on the DataStream resulting from the windowed operation with the same OutputTag.

Example:

>>> tag = OutputTag("late-data", Types.TUPLE([Types.INT(), Types.STRING()]))
>>> main_stream = ds.window_all(TumblingEventTimeWindows.of(Time.seconds(5))) \
...                 .side_output_late_data(tag) \
...                 .process(MyProcessAllWindowFunction(),
...                          Types.TUPLE([Types.LONG(), Types.LONG(), Types.INT()]))
>>> late_stream = main_stream.get_side_output(tag)

previous

pyflink.datastream.data_stream.AllWindowedStream.allowed_lateness

next

pyflink.datastream.data_stream.AllWindowedStream.reduce

On this page
  • AllWindowedStream.side_output_late_data()

This Page

  • Show Source

Created using Sphinx 7.4.7.

Built with the PyData Sphinx Theme 0.16.1.