Comments (4)
Just to add an MWE:
julia> using CSV, DataFrames
julia> df = DataFrame(rand(2_000_000, 4));
julia> CSV.write("test_out.csv", df);
julia> @time CSV.read("test_out.csv", DataFrame);
0.569546 seconds (154.47 k allocations: 75.956 MiB, 3.24% gc time)
julia> df = DataFrame(BigFloat.(rand(2_000_000, 4)));
julia> CSV.write("test_out.csv", df);
julia> @time CSV.read("test_out.csv", DataFrame);
629.696438 seconds (6.00 G allocations: 97.848 GiB, 23.51% gc time)
(@v1.5) pkg> st CSV
Status `C:\Users\ngudat\.julia\environments\v1.5\Project.toml`
[336ed68f] CSV v0.7.7
from parsers.jl.
Ok, merged #76 and the perf should be much much better now
from parsers.jl.
Looks like a Parsers.jl issue, I'll dig in to see what we can do to make it faster:
julia> @btime parse(Float64, "0.8720837370748981687285095176775939762592315673828125")
452.393 ns (0 allocations: 0 bytes)
0.8720837370748982
julia> @btime Parsers.parse(Float64, "0.8720837370748981687285095176775939762592315673828125")
20.607 μs (654 allocations: 13.08 KiB)
0.8720837370748982
from parsers.jl.
Ok, PR finally up; it was a beast of a refactor, but the performance improvements are about 20x for these large number cases. #76
from parsers.jl.
Related Issues (20)
- Unclear documentation for `getstring` HOT 6
- Documentation for `ReturnCode` is mangled
- Streamline `xparse` interface HOT 1
- Delete the master branch? HOT 1
- Parsers.jl v2.4.1 breaks InlineStrings.jl HOT 1
- Add Integration Tests for downstream packages HOT 2
- Use quotes to disambiguate empty and missing strings
- InlineStrings.jl tests fail on Parsers.jl `main` HOT 3
- Benchmark current main vs. current release
- Parsers.jl version 2.5 appears to break OpenML.jl HOT 4
- Re-enable JET
- Parsers reads out of bounds in `checkdelim!`
- quarto build broken on Parsers v2.5.7
- BoundsError for test that parses incorrect UUID HOT 1
- `groupmark` bugs HOT 1
- Automatic tagging disabled HOT 1
- `xparse` changed behavior after #127 (regression?) HOT 1
- Make it easier to modify behavior of `parse` / `tryparse` / `xparse2` HOT 2
- Ensure `decimal` being the same as `delim` is handled consistently
- Buffer Overflow during precompile HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from parsers.jl.