Comments (6)
odbc2parquet 3.1.0
is released which emits a log message on info level stating the number of currently fetched rows.
from odbc2parquet.
Currently it is entirely possible to use odbc2parquet
in situations there the first files has been written while even the data source is not sure how many rows the query will yield. odbc2parquet
has no idea what the maximum numbers of row are. I am not confident that every data source would support count_big
or that it would always be cheap to call.
Emitting the total number of rows fetched so far on the other hand is easy. I see nothing that would speak against that.
from odbc2parquet.
Yeah, the count thing would only make sense with a special "full table copy" mode, where you're not passing a query but a table name.
from odbc2parquet.
I have thought a bit about how to do a progress bar, but there is no good way to retrieve the total number of rows without making additional assumptions about the data source in question. The age old tradeoff between usefulness and re-usability.
However I could consider making the core functions of this CLI tool available as the library, allowing you to build a specialized CLI tool for your use-case on top. I would only extend the mental effort if this is something you would consider, though. Otherwise I am also fine with closing the issue and calling it a day.
Best, Markus
from odbc2parquet.
Ok! I don't think I would use that library given that I'm much more fluent in Python and there are various easy to use ways to fetch data from SQL Server and write to Parquet in Python.
from odbc2parquet.
Thanks for the feedback, closing the issue.
from odbc2parquet.
Related Issues (20)
- Issue with MySQL JSON columns HOT 8
- Reserved Column Names not Supported HOT 1
- Feature Request - Support column encryption in the generated parquet file HOT 4
- JobName as .sql file in config file HOT 4
- Parquet format version support HOT 9
- Feature suggestion: connect to URL `postgresql://username:pass@host/database` HOT 1
- What permissions are needed? - State: 42501, Native error: 1, Message: ERROR: permission denied HOT 4
- StarRocks parquet file import of parquet file generated by odbc2parquet fails with encoding error HOT 11
- Memory allocation with column-length-limit HOT 11
- Build for alpine HOT 8
- file-size-threshold generates wrong size files HOT 1
- --no-empty-file option doesn't work properly when row-groups-per-file should devide result into few files HOT 6
- MSSQL nvarchar - missing column in output file HOT 2
- Data source must return valid UTF16 in wide character buffer: Utf16Error HOT 4
- Write statistics HOT 14
- Make zstd the default compression HOT 4
- Build release assets for Ubuntu ARM64 as well HOT 11
- Exporter adding trailing zero's in when exporting from PostgreSQL Numeric dtype HOT 5
- thread 'main' panicked at src/query/date.rs:60:87: called `Option::unwrap()` on a `None` value HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from odbc2parquet.