Flink + Bolt #20

lgbo-ustc · 2025-12-12T03:14:21Z

lgbo-ustc
Dec 12, 2025

We have recently implemented a native execution engine for Flink based on Velox and have successfully run some Nexmark test cases. Currently, the system does not yet fully support stateful operators. Since progress on introducing new features in Velox has been relatively slow, we are considering whether Bolt could serve as an alternative implementation.

Flink execution follows a push model, while Velox execution is based on a pull model. During earlier development, we attempted to implement a push-like execution flow in Velox, but this would have required significant modifications to the framework. We are now exploring whether Flink’s computational characteristics can be met without changing the execution model of Velox/Bolt. After some reconsideration, we believe it is feasible. Below is the approach we have in mind, and we would like to hear feedback from the community.

We intend to treat Velox/Bolt as the internal computation implementation of a Flink operator. From this perspective, whether Velox/Bolt operates in pull or push mode is not fundamentally different—the two are equivalent. The internal execution flow within a Flink operator would be as follows:

public void processElement(StreamRecord<T> element) {
    /*
     * 1) Feed the input element into the source node of the Velox/Bolt execution plan
     * 2) Call getOutput to retrieve results from the Velox/Bolt execution plan
     * 3) Push the result to Flink’s output collector
     */
}

This implementation does not disrupt Flink’s push-based execution logic and preserves its backpressure control mechanism.

In Flink’s StreamGraphTranslator, we have modified the JobGraph. We still follow the principle that if two adjacent operators can both be offloaded to the native execution engine, they are merged into the same Velox/Bolt plan, forming a new operator. Currently, Velox/Bolt does not seem to support operators with multiple outputs, which may require an implementation similar to a local exchange queue. We have adopted a simpler approach: if an operator has multiple outputs, it will not be merged with its downstream operators, and communication between them will still occur through Flink channels.

The StreamGraphTranslator partitions an operator chain into different segments based on the following rules:

If two adjacent operators have different offloading capabilities to the native execution engine, they are placed in separate segments.

If an operator has multiple outputs, it is placed in a different segment from its downstream operators.

If an operator has multiple inputs, it is placed in a different segment from its upstream operators.

Changes to an operator chain before and after the modification are as follows.

We have also introduced a new serializer, RowVectorSerializer. When two adjacent Flink operators can both be offloaded, the channel between them is configured to use RowVectorSerializer, allowing direct RowVector data transfer and avoiding row-column conversion overhead. Moreover, within the same node, two operators will only pass a RowVector pointer without serialization or deserialization.

zhanglistar · 2025-12-12T03:25:01Z

zhanglistar
Dec 12, 2025

@frankobe How do you think?

0 replies

frankobe · 2025-12-15T23:29:01Z

frankobe
Dec 15, 2025
Maintainer

It will be beneficial to join force on the Flink acceleration plan. RocksDB-based stateful operator is implemented internally. @zhanglistar @lgbo-ustc Feel free to comment in https://docs.google.com/document/d/1gNf-9VJEyMw1icEh3UnNsGwoGYuZu3qC3RIChWeS3F4/edit?tab=t.0#heading=h.34xj7176473v

@luozenglin and @yangzhg should we merge the #23 here?

1 reply

yangzhg Dec 18, 2025
Collaborator

should we merge the #23 here

OK. That should be a better way to cooperate with the community.

liuyongvs · 2026-03-27T07:23:42Z

liuyongvs
Mar 27, 2026

hi @frankobe @lgbo-ustc what is the progress ?

1 reply

frankobe Apr 16, 2026
Maintainer

@liuyongvs We are rolling out Flink-on-Bolt (project: Blint) on stateless & stateful Flink SQL jobs internally. The current plan is to open source Blint as a subproject under Bolt on 26Q3 - 26Q4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flink + Bolt #20

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Flink + Bolt #20

Uh oh!

lgbo-ustc Dec 12, 2025

Replies: 3 comments · 2 replies

Uh oh!

zhanglistar Dec 12, 2025

Uh oh!

Uh oh!

frankobe Dec 15, 2025 Maintainer

Uh oh!

yangzhg Dec 18, 2025 Collaborator

Uh oh!

liuyongvs Mar 27, 2026

Uh oh!

frankobe Apr 16, 2026 Maintainer

lgbo-ustc
Dec 12, 2025

Replies: 3 comments 2 replies

zhanglistar
Dec 12, 2025

frankobe
Dec 15, 2025
Maintainer

yangzhg Dec 18, 2025
Collaborator

liuyongvs
Mar 27, 2026

frankobe Apr 16, 2026
Maintainer