Comments (7)
Fuse processing number% 3 is slow. The main reason is that type conversion is made every time. I try to remove type conversion. The performance can be 2G -- > 5G / s
from arrow2.
#252 shows the main reason of cast.
I did some perfs in datafuse, seems the rem is the hot path.
from arrow2.
By using strength_reduce
, it shows great improvement in datafuse.
from arrow2.
This is a valid request: valid use-case, documented benefits, crate with implementation available. 👍
Would you like to work on it, or would you like me to take it?
Also, I went through the crate and is unsafe
free, so even easier sell.
from arrow2.
The same optimization can be used for division as well right?
from arrow2.
The same optimization can be used for division as well right?
Yes! Of course.
Would you like to work on it, or would you like me to take it?
Sorry, currently I may have no time to work on it, because there are some urgent issues I have to do in datafuse.
Now I just provide this idea to make arrow2 work better.
And strength_reduce
is not generically implemented, so it may need some type match case to dispatch the codes.
from arrow2.
Done in #275
from arrow2.
Related Issues (20)
- Error when timestamp casting for time unit millisecond or microsecond HOT 1
- does arrow2 support filter pushdown in parquet reader HOT 1
- Avro maps are unsupported
- Writing chunked dictionary arrays to IPC currently impossible due to difference in key maps? HOT 1
- Incorrect nullability inferred for nested parquet schema HOT 2
- Any plans to add an async flavor for json/ ndjson format?
- MutableDictionaryArray - another rewrite needed HOT 1
- arrow2 0.18.0 release broke against minimal dependencies in the Cargo.toml HOT 3
- `infer_records_schema` results in incorrect `Schema` when input json is in non-`Chunk` form
- Add Float16/Half-float logical type to Parquet
- Compressed IPC Crash in certain cases HOT 1
- Crash when loading avro file
- Specify compression per column instead of globally
- deserialize_schema looks not working
- Support for Utf8View in the Rust library HOT 1
- Upgrade odbc-api to stable 4.1.0
- [nightly] When compiling with `+nightly` one symbol is not found. HOT 1
- Tags / Commits for the 0.18.0 release HOT 1
- Append to existing ipc file results in ErrorLocation: InvalidOffset when reading new block
- arrow2 cannot read ipc files compressed by official's arrow crate
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from arrow2.