palashkulsh / dbcrawler Goto Github PK

View Code? Open in Web Editor NEW

4.0 4.0 0.0 55 KB

crawls database and makes insert statement for all the tables so that some data can be taken from all tables

JavaScript 93.66% Yacc 6.34%

dbcrawler's People

Contributors

Stargazers

Watchers

dbcrawler's Issues

manage white spaces in constraints

actor_info.actor_id=actor.actor_id,film.film_id=film_actor.film_id is valid

but
actor_info.actor_id=actor.actor_id, film.film_id=film_actor.film_id is not valid. This is due to white space trimming not considered while parsing.

column filter when making dump

column wise filtered dump creation may be supported

dbcrawler getting stuck when getting huge quantity of rows for a query

The main reason behind dbcrawler getting stuck is that each input generated is being checked/deep equaled to every other input that has been generated till now. This acts as a failsafe for circular queries or preventing same insert statement /data being generated twice or more time in final output.

For eg if row 1 of table a is generated twice due to some dependency then two same insert statments will be generated .This will cause the conflict when inserting (due to duplicate keys).

insert statement too long

If the values retrieved for single select are too much then insert statement generated is too long as it generates the insert statement of the type "insert into table (columns...) values (first set of values),(2nd set of values),(3rd set of values)... and so on. Last list of values leads to sql being oversized and hence throwing error

not able to read generated schema file from /tmp directory

The crawler is not able to read the generated schema file from /tmp directory because the sql module is not present in tmp module. It can read the schema file from inside the scope where sql is present.

[Error: not able to read generated schema file Error: Cannot find module 'sql']

provide support for other sql dialects like postgres etc

handle dbcrawler working on windows

There can be issues related to dbcrawler not working on windows system. The apparent reasons being

there is no tmp directory in windows where the schema can be generated as a fallback option.

scout for more windows related issues

error not able to parse generated schema file

running dbcrawler on one of the servers gave not able to parse generated schema file

fallback option for schema generation location

When the auto gen folder is inaccessible then there should be fallback location where schema can be generated at. one possible location is /tmp/ directory

dbcrawler breaks if password has @ in it

dbcrawler breaks if there is @ or ) somewhere in the password

manage white space trimming while parsing seed data

White space in seed data "actor.actor_id=3;actor.actor_id=4;film_actor.actor_id=5" throws error in dbcrawler . For eg. actor.actor_id=3;actor.actor_id=4; film_actor.actor_id = 5 will throw error as white space is not considered while parsing seed data.

provide version in commander from package json version somehow

version in commander should be provided from the actual version inside the package json file then only version would have some meaning.

remove defaults from commander

Defaults from commander and other places should be removed as they wont run on any other system except mine (because database is not installed on it). and all the config parameters must be made compulsory

path to grammar being resolved incorrectly

path to grammar files present in lib are being resolved incorrectly as lib is being resolved to current available lib path path. May be using ./lib/grammar files will solve the problem

crawler hangs or something chokes it when giving illegal input

there must be some sort of logging to show progress.
something was choking the crawler when giving the input
'sales_data.order_id=350465135112'
look into it

provide port support for input options

along with host and other information of database provide port option too so that if database is available at non standard port like 3307 instead of 3306 then it can be supplied by the user

support for dumping without seed

To take full dump of a table dbcrawler should give full dump of the table

not able to run the generated sql file

The generated sql file is not runnable because it lacks the semicolon after individual insert queries.

ignore column filter not working

show dumping progress indicator

showing work progress on the screen assures the user that something is being done and that the program has not hung itself

should dbcrawler support giving json data or not.

should dbcrawler support providing data or not. this is crucial to future of crawler

cli support for simple input

Provide cli support for following items

input constraints from file
output file location

handle empty string in constraints as well as seed

giving input on commandline as [-c "" ] gives the error

message: 'Parse error on line 1:\n\n^\nExpecting 'STRING_LIT', got 'EOF'',
hash:
{ text: '',
token: 'EOF',
line: 0,
loc: { first_line: 1, first_column: 0, last_line: 1, last_column: 0 },
expected: [ ''STRING_LIT'' ] } }

refactor and improve logging inside dbcrawler

some place util log some place console log. what horse shit is this. this shit is better than that shit

provide filter support on join conditions like gt,lt etc

for powering collection apis dbcrawler will need collection like support that is join a table and filter the data that is being retrieved from that table.

accept running crawler without constraints

when running dbcrawler the output must be the dump of only the given seed data.
2 things should be kept in mind

running without constraints must be allowed by requiring
running without constraints must allowed from command line

feature to pass configurations parameters from command line

To run the crawler we have to pass/change the parameters from the file itself. Feature to pass changeable parameters from command line also will make the task of running the program easier.
So database parameters like host,password,database,user can be passed from command line .

Also crawling constraints can also be allowed to pass from command line though passing them from commandline would require extra parsing step or some other way to specify them.

palashkulsh / dbcrawler Goto Github PK

dbcrawler's People

Contributors

Stargazers

Watchers

dbcrawler's Issues

Recommend Projects

Recommend Topics

Recommend Org