Hi , Currently we are using lookups-cached-global extension for loading lookups in

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

using druid maha lookups as a replacement for lookups-cached-global about maha HOT 4 CLOSED

vsharathchandra commented on May 20, 2024

using druid maha lookups as a replacement for lookups-cached-global

from maha.

Comments (4)

patelh commented on May 20, 2024

Of your 50-100 lookups, how many have the same key?

How long does it currently take to load the lookups?

You could convert your lookups to RocksDB based lookups where you create new snapshots once a day and publish updates via Kafka. This would require you to build a new RocksDB instance once a day, zip it up and publish it to HDFS. But it also means you would need some daemon process to do change data capture and publish the updated or new rows to Kafka.

In your 50-100 lookups, if many of your lookups share the same key, you could replace them with our JDBC lookup since it allows for multiple values to be loaded in one lookup, saving duplication of key space. E.g. lookups-cached-global you have one key to one value: Map(a -> aa, b -> bb) Map(a-> 123, b -> 456), our JDBC lookups allow for just one lookup : Map( a -> (aa, 123), b -> (bb, 456)). At query time, you just specific which column you want in the extraction function.

from maha.

vsharathchandra commented on May 20, 2024

We haven't properly monitored the loading time.For one large lookup(around 10 million entries) , it takes around 45 minutes.

from maha.

patelh commented on May 20, 2024

@vsharathchandra might be easier to talk about this on gitter or hangouts

from maha.

vsharathchandra commented on May 20, 2024

okay sure will contact you on gitter.

from maha.

using druid maha lookups as a replacement for lookups-cached-global about maha HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent