Giter Club home page Giter Club logo

stream-processing-with-apache-flink's Introduction

Hi ๐Ÿ‘‹ I'm Giannis

and I'm an Architect & Trainer focusing on Streaming Data.

Passionate for Event Streaming Systems, Stateful Stream Processing and Streaming Lakehouses

  • ๐Ÿ”ญ Working @ Ververica
  • ๐ŸŒฑ Iโ€™m focusing on Apache Flink and Apache Paimon.
  • ๐Ÿ“ I write articles on medium and from time to time @ Rock The JVM Blog
  • ๐Ÿ’ฌ Ask me about Apache Flink, Apache Paimon and Streaming Data Systems.
  • ๐Ÿ“ซ How to reach me https://www.linkedin.com/in/polyzos/

stream-processing-with-apache-flink's People

Contributors

polyzos avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

stream-processing-with-apache-flink's Issues

A question about the Docker networking setup

Hi there- thank you so much for this repo, and the forthcoming book! I'm looking forward to it.

I want to ask a question about how the networking is working to clarify my understanding on the Flink cluster setup. Please let me know if GitHub is the wrong place to ask this question, and if there is anywhere else that I could ask it!

I understand that the JobManager has two ports exposed to the host machine, 8081 for the web interface and 9249 for Flink internal metrics reporting. Each of the TaskManager containers then map from 9249 internally to 9250 and 9251 respectively to similarly report metrics to the host.

But don't the two TaskManager containers also have to communicate with each other (as per the documentation for standalone clusters, and as shown in this deep dive)?

Many thanks again, and apologies if this is a straightforward question!

Missing kafka environment variables

When I tried kafka got this error
ERROR ==> Kafka haven't been configured to work in either Raft or Zookeper mode. Please make sure at least one of the modes is configured.

To solve I had to add KAFKA_CFG_PROCESS_ROLES: controller,broker environment variable

unable to create topics in Redpanda

Hi there

I started up the Docker environment and everything came up fine from what I can tell. However when I try to execute the redpanda setup script I get the following message.

unable to create topics [accounts]: unable to dial: dial tcp 127.0.0.1:9092: connect: connection refused

I also tried running the rpk cluster info from the container and received the following.

unable to request metadata: unable to dial: dial tcp 127.0.0.1:9092: connect: connection refused

I'm running on an Intel based MacBook Pro and using colima in place of Docker Desktop, if that makes a difference.

Thank you

Lookup join example is potentially wrong

Hi,

I'm going through the book and I think the lookup join query is incorrect.

SELECT
    transactionId,
    t.accountId,
    t.eventTime_ltz,
    TO_TIMESTAMP_LTZ(updateTime, 3) AS updateTime,
    type,
    amount,
    balance,
    districtId,
    frequency
FROM transactions AS t
    JOIN accounts FOR SYSTEM_TIME AS OF t.eventTime_ltz AS a
    ON t.accountId = a.accountId;

As it is mentioned in the intro, a processing time attribute is required, however, in the example an event time field is used. On the Flink Dashboard it is also visible that the executed query is doing a temporal join. Switching to a field which is defined as PROCTIME() turns the query into a lookup join.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.