hoxu / gitstats Goto Github PK
View Code? Open in Web Editor NEWgit history statistics generator
Home Page: http://gitstats.sourceforge.net/
git history statistics generator
Home Page: http://gitstats.sourceforge.net/
gitstats . stats
[0.43809] >> gnuplot --version
Output path: /home/johannes/Dokumente/Universität/Informatik/Master/12ss/Masterarbeit/stats
Loading cache...
Git path: .
Collecting data...
git shortlog -s HEAD | wc -lerror: Could not read b7924b0488d62b9aecaead77ae47608fa172ef46
fatal: Failed to traverse parents of commit 32780257da003ca294d825abdc2d5226c824e62f
[0.01169] >> git shortlog -s HEAD | wc -l
[0.00487] >> git show-ref --tags
git rev-list --pretty=format:"%at %ai %aN <%aE>" HEAD | grep -v ^commiterror: Could not read b7924b0488d62b9aecaead77ae47608fa172ef46
fatal: Failed to traverse parents of commit 32780257da003ca294d825abdc2d5226c824e62f
[0.01512] >> git rev-list --pretty=format:"%at %ai %aN <%aE>" HEAD | grep -v ^commit
git rev-list --pretty=format:"%at %T" HEAD | grep -v ^commiterror: Could not read b7924b0488d62b9aecaead77ae47608fa172ef46
fatal: Failed to traverse parents of commit 32780257da003ca294d825abdc2d5226c824e62f
[0.01445] >> git rev-list --pretty=format:"%at %T" HEAD | grep -v ^commit
[0.02945] >> git ls-tree -r -l -z HEAD
git log --shortstat --first-parent -m --pretty=format:"%at %aN" HEADerror: Could not read b7924b0488d62b9aecaead77ae47608fa172ef46
fatal: Failed to traverse parents of commit 32780257da003ca294d825abdc2d5226c824e62f
[3.14722] >> git log --shortstat --first-parent -m --pretty=format:"%at %aN" HEAD
git log --shortstat --date-order --pretty=format:"%at %aN" HEADerror: Could not read b7924b0488d62b9aecaead77ae47608fa172ef46
fatal: revision walk setup failed
[0.01042] >> git log --shortstat --date-order --pretty=format:"%at %aN" HEAD
Refining data...
Saving cache...
Traceback (most recent call last):
File "/usr/local/bin/gitstats", line 1424, in
g.run(sys.argv[1:])
File "/usr/local/bin/gitstats", line 1405, in run
data.refine()
File "/usr/local/bin/gitstats", line 573, in refine
self.authors_by_commits = getkeyssortedbyvaluekey(self.authors, 'commits')
File "/usr/local/bin/gitstats", line 78, in getkeyssortedbyvaluekey
return map(lambda el : el[1], sorted(map(lambda el : (d[el][key], el), d.keys())))
File "/usr/local/bin/gitstats", line 78, in
return map(lambda el : el[1], sorted(map(lambda el : (d[el][key], el), d.keys())))
KeyError: 'commits'
When running gitstats, I am seeing an IndexError: list index out of range error. This is the first time running gitstats on this repo.
[[0.08800] >> git cat-file blob 42468d8ff605ff955b875916bfc80acdf95fe1c4 | wc -l
2.71800] >> git cat-file blob b2c62966c30359a015256e3f513bac08331438c0 | wc -l
Traceback (most recent call last):
File "gitstats.py", line 1490, in <module>
g.run(sys.argv[1:])
File "gitstats.py", line 1465, in run
data.collect(gitpath)
File "gitstats.py", line 518, in collect
ext_blob_linecount = pool.map(getnumoflinesinblob, blobs_to_read)
File "C:\Python27\Lib\multiprocessing\pool.py", line 251, in map
return self.map_async(func, iterable, chunksize).get()
File "C:\Python27\Lib\multiprocessing\pool.py", line 558, in get
raise self._value
IndexError: list index out of range
When I run this git cat-file
command, this is what is returned:
C:\projects\sandbox>git cat-file blob d469d018cece658a71ced1c7cf3cfffce2d24f9d | wc -l
97
C:\projects\sandbox>git cat-file blob 61f9e75f9b63ded44c5d3fc9a9269bb86bb647bd | wc -l
161
C:\projects\sandbox>git cat-file blob 88e064f90313a614abd6b3a2081bcb21c0539cb2 | wc -l
45
C:\projects\sandbox>git cat-file blob 7180af8494c4dae57cad082b42277298b00b8669 | wc -l
331
C:\projects\sandbox>git cat-file blob 49ffd7368bc396cd10dbf3e7439b7d0f296010cf | wc -l
97
C:\projects\sandbox>git cat-file blob f0ce1e6845fff2a765cad458e987fc06f083b3f7 | wc -l
250
C:\projects\sandbox>git cat-file blob df0fc164e6b33bbc382d5a6c51c002199b551834 | wc -l
26
C:\projects\sandbox>git cat-file blob a6912c1194cc19ce52e58ca03de5ded92835eb7b | wc -l
32
C:\projects\sandbox>git cat-file blob 42468d8ff605ff955b875916bfc80acdf95fe1c4 | wc -l
109
C:\projects\sandbox>git cat-file blob b2c62966c30359a015256e3f513bac08331438c0 | wc -l
96
C:\projects\sandbox>git cat-file blob d7ac3f820842c8eed99882cdba9b89d729070415 | wc -l
107
C:\projects\sandbox>git cat-file blob 9f3794ba017d460ad9ef77004f95755bc4817ffb | wc -l
76
C:\projects\sandbox>git cat-file blob f558f59e09d9e181a6a35a08092ed46ed3380916 | wc -l
236
C:\projects\sandbox>git cat-file blob f5f7834784f05eb7291cf54152b84996dcbe7b7b | wc -l
11
C:\projects\sandbox>git cat-file blob 69eaececede172cf1029c7c0c1d7adef505b924d | wc -l
89
Any chance you could create a 'release' or tag with the current stable version?
The 'homebrew' project uses these to help with installation.
There's been a couple open requests for this over time, and it would behelpful to the users at large.
Homebrew/legacy-homebrew#25447
Homebrew/legacy-homebrew#14418
Thanks!
git shortlog -s HEAD | wc -l
^CTraceback (most recent call last):
File "gitstats/gitstats", line 1432, in
g.run(sys.argv[1:])
File "gitstats/gitstats", line 1409, in run
data.collect(gitpath)
File "gitstats/gitstats", line 262, in collect
self.total_authors += int(getpipeoutput(['git shortlog -s %s' % getcommitrange(), 'wc -l']))
File "gitstats/gitstats", line 58, in getpipeoutput
output = p.communicate()[0]
File "/usr/lib/python2.6/subprocess.py", line 693, in communicate
stdout = self.stdout.read()
KeyboardInterrupt
Started with outputpath = local git checkout.
cd Downloads/gitstats-master && ./gitstats /Users/X/git/myrepo .
Error observed:
Refining data...
Saving cache...
Generating report...
Traceback (most recent call last):
File "./gitstats", line 1490, in <module>
g.run(sys.argv[1:])
File "./gitstats", line 1477, in run
report.create(data, outputpath)
File "./gitstats", line 744, in create
shutil.copyfile(src, path + '/' + file)
File "/usr/local/Cellar/python@2/2.7.15_3/Frameworks/Python.framework/Versions/2.7/lib/python2.7/shutil.py", line 83, in copyfile
raise Error("`%s` and `%s` are the same file" % (src, dst))
shutil.Error: `/Users/XYZ/Downloads/gitstats-master/gitstats.css` and `/Users/XYZ/Downloads/gitstats-master/gitstats.css` are the same file
Perhaps it would help to either check on startup (a) if the local directory is the "source" directory, and/or (b) if the resources that gitstats
will to deploy already exist? On big Git repositories, this would save the user from re-starting the whole process.
gnupolt not found problem, though had configured gnuplot environment variable(means I can get gnuplot version when type in 'gnuplot --version'),
So I solved this by delete the three lines, and It worked
I have to use ProcessExplorer to "Resume" the process ... until that GitStats sleeping :-O
It looks like there are a large number of PRs unanswered. Is there a fork we should be using instead?
File "C:\Python27\lib\multiprocessing\forking.py", line 489, in prepare
file, path_name, etc = imp.find_module(main_name, dirs)
ImportError: No module named gitstats
As of pull #35 merging author information is now handled by .mailmap.
Yet developers use the file for two distinct reasons:
That leads to companies getting smaller attribution in the "Commits by Domain" section.
Sadly we cannot correct every project to use .mailmap 'correctly', yet we want to give credit where it's due.
I would like to address that and the quickest way is to revert the above PR.
Any objections, alternative suggestions how to address the issue?
in Collect revision statistics
lines = getpipeoutput(['git rev-list --pretty=format:"%%at %%ai %%aN <%%aE>" %s' % getlogrange('HEAD'), 'grep -a -v ^commit']).split('\n')
<<
grep breaks the PIPI with: Binary file (standard input) matches.
Adding -a to grep solves the issue.
It's been brought to my attention that Fedora 31 will be going Python3 only:
https://fedoraproject.org/wiki/Changes/F31_Mass_Python_2_Package_Removal
https://bugzilla.redhat.com/show_bug.cgi?id=1737997
As it stands this will result in the removal of gitstats from the Fedora repositories, though I am investigating how to avoid this (by e.g. carrying patches to make it python3 compatible).
I thought I'd ask however if there was any plan with regards to supporting python3 for gitstats, as while Fedora is likely pushing the curve here other distributions may follow suit in future? As far as I can tell this last came up back in 2015 as distros started defaulting /usr/bin/python to python3:
I've installed Python 2.7.
I've installed gnuplot.
I have git.
I've added python path, gunplot path and the folder where gitstats sit into the system path.
running "python gitstats git_repo_path out_path" chucks a couple of lines, and then runs into an endless loop of missing and failed imports.
here's a screenshot: http://i.imgur.com/QHwpgU0.png
any suggestions ?
Hello there,
just if anybody tried to run gitstats with OS X El Capitan, for me these steps worked:
brew install gnuplot
sudo ln -s /usr/bin/python2.7 /usr/local/bin/python2
make install
in repository root (xcode and developer tools are required)Folloing instructions from http://gitstats.sourceforge.net/, I tried to install gitstats
with command
brew install --HEAD homebrew/head-only/gitstats
But got this error:
Error: homebrew/head-only was deprecated. This tap is now empty and all its contents were either deleted or migrated.
Readme needs documentation on how to actually use it
On ubuntu 12.04, when I use the command : gitstats ../wsb/code/sud-ouest-france ~/gitstat
the processing starts, then :
Traceback (most recent call last):
File "/usr/bin/gitstats", line 1383, in
g.run(sys.argv[1:])
File "/usr/bin/gitstats", line 1360, in run
data.collect(gitpath)
File "/usr/bin/gitstats", line 414, in collect
size = int(parts[3])
ValueError: invalid literal for int() with base 10: '-'
After running :
gitstats -c start_date="2016-01-01" . ../stats
I get this error :
File "/usr/local/bin/gitstats", line 57 print '>> ' + ' | '.join(cmds), ^ SyntaxError: invalid syntax
Hi All,
I need statistics for commits,lines in all branches in specific repository.
I saw FAQ in man gitstats , but it seems not work, it still generate the report for only the main branch.
gitstats -c commit_end='web' -c start_date='2017/1/1' cosmos-main-html5 gitstats
Can anybody help on this ?
FAQ
Q: How do I generate statistics of a non-master branch?
A: Use "-c commit_end=web" parameter.
regards,
Jonas
Whenever an author name contains , they are output as-is in gnuplot files. gnuplot then runs the command that they contain, making it extremely unsafe to run gitstats on untrusted repositories, where author names could contain
malicious command``.
The doc/license file states that the licensing of the project is as follows:
License is GPLv2/GPLv3, see doc/GPLv2 and doc/GPLv3 respectively.
The header of the sortable.js file has this statement:
Table sorting script by Joost de Valk, check it out at http://www.joostdevalk.nl/code/sortable-table/.
Based on a script from http://www.kryogenix.org/code/browser/sorttable/.
Distributed under the MIT license: http://www.kryogenix.org/code/browser/licence.html .
Copyright (c) 1997-2007 Stuart Langridge, Joost de Valk.
The license file should be updated to more accurately reflect the licensing for the project. For consistency the name of the file should also be LICENSE (all caps).
[0.26964] >> git rev-list --pretty=format:"%at %ai %aN <%aE>" HEAD | grep -v ^commit
Traceback (most recent call last):
File "/usr/bin/gitstats", line 1490, in
g.run(sys.argv[1:])
File "/usr/bin/gitstats", line 1465, in run
data.collect(gitpath)
File "/usr/bin/gitstats", line 339, in collect
author, mail = parts[4].split('<', 1)
ValueError: need more than 1 value to unpack
In my case, The %aN is "Shinichiro Nakamura", so "parts[4].split('<', 1)" was failed.
'wc' is not recognized as an internal or external command,
operable program or batch file.
Original title: Error while running gitstats with opensuse kernel
I've cloned the following repository
git://gitorious.org/opensuse/kernel.git
While running ./gitstats /path/to/git/repo /output/folder I face this error
"[2.11899] >> git shortlog -s "v3.0.78" "^v3.9.1"
[1.76242] >> git shortlog -s "v3.4.45" "^v3.0.78"
[1.95104] >> git shortlog -s "v3.8.13" "^v3.4.45"
[0.57330] >> git shortlog -s "v3.9.2" "^v3.8.13"
[0.69536] >> git shortlog -s "v3.10-rc1" "^v3.9.2"
[1.95851] >> git shortlog -s "v3.2.45" "^v3.10-rc1"
[6.77937] >> git shortlog -s "rpm-3.0.74-0.6.8" "^v3.2.45"
[0.03402] >> git shortlog -s "v3.0.79" "^rpm-3.0.74-0.6.8"
[1.77202] >> git shortlog -s "v3.4.46" "^v3.0.79"
[2.23483] >> git shortlog -s "v3.9.3" "^v3.4.46"
[0.71021] >> git shortlog -s "v3.10-rc2" "^v3.9.3"
[12.44785] >> git rev-list --pretty=format:"%at %ai %aN <%aE>" HEAD | grep -v ^commit
Traceback (most recent call last):
File "./gitstats", line 1430, in
g.run(sys.argv[1:])
File "./gitstats", line 1407, in run
data.collect(gitpath)
File "./gitstats", line 312, in collect
author, mail = parts[4].split('<', 1)
IndexError: list index out of range
"
FYI
a) I repeated the process more than 3 times and the error still appears.
b) With other repositories I had no issue or any error while running ./gitstats.
Hi,
The result of commit by authors only display top 20, can it display all authors?
I would like to add support for excluding one or more authors, the main use case I have for this is that the "Jenkins" user comes up as the top author in the statistics because it is the author of all the merge commits.
Hello,
It’d be great if you could tag a stable release so we can use the GitHub-generated tarball instead of relying on the git HEAD
commit.
Thanks!
I think commit 780c0fd (start_date
command line option) introduced a bug that broke commit_begin
command line option.
def getlogrange(defaultrange = 'HEAD', end_only = True):
commit_range = getcommitrange(defaultrange, end_only)
if len(conf['start_date']) > 0:
return '--since=%s %s' % (conf['start_date'], commit_range)
return commit_range
The way how getlogrange
is implemented and called in other parts means commit_begin
has no effect.
I propose the following fix:
start_date
and commit_begin
command line options mutually exclusive - only one of them can be defined. Error will be raised if both options are present.getxxxrange
so that all combinations of start_date
, commit_begin
and end_only
are handled correctly.Any plans into migrating the code over to Py3?
It's now possible to "yum install gitstats" on Fedora 17 and later and EL6 distributions that have the EPEL repository enabled [1]. Wondering if we can get a shoutout to this on the web site:
http://gitstats.sourceforge.net/
Thanks,
Steve
In the common workflow, we mostly need stats for actual implementation commits, i.e we don't want to include merge commits in the git reports.
Can we add a param to exclude merge commits from git statistics --no-merges
Refining data...
Saving cache...
Generating report...
Traceback (most recent call last):
File "./gitstats", line 1490, in <module>
g.run(sys.argv[1:])
File "./gitstats", line 1477, in run
report.create(data, outputpath)
File "./gitstats", line 744, in create
shutil.copyfile(src, path + '/' + file)
File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/shutil.py", line 69, in copyfile
raise Error("`%s` and `%s` are the same file" % (src, dst))
shutil.Error: `/Users/samm/othergit/gitstats/gitstats.css` and `/Users/samm/othergit/gitstats/gitstats.css` are the same file
Hi,
Thanks for gitstats.
I use git for the sources of a website dedicated to teaching. Years begin in september and end in august.
How easy it would be to add a conf option to set a starting month that would make Gistats report years as 2013-2014? Could you give some pointers?
Hi,
I wrote a git dynamic repository visualization (see examples at: https://github.com/joe42/git_treemap).
I would like to integrate git treemap with gitstats. If you are interested, I will write a suitable pull request.
Momentarily, git treemap produces an html file with a dynamic treemap relying on the InfoViz JavaScript Toolkit. To this end, the git repository statistics are efficiently extracted with GitPython and inserted as JSON data into an html template with the help of Python's Template module.
Please let me know if you are interested to integrate git treemap.
Thanks for writing and maintaining gitstats,
joe
I am getting the following error when I try to execute
./gitstats -c start_date="$(date +%Y-%m-%d -d "3 month ago")" "/repository/<repo_name>/.git" "/repository/gitresult"
Error:
[0.00403] >> git shortlog -s --since="2018-05-29" "HEAD" | wc -l
[0.00518] >> git show-ref --tags
[0.00352] >> git rev-list --pretty=format:"%at %ai %aN <%aE>" --since="2018-05-29" "HEAD" | grep -v ^commit
Traceback (most recent call last):
File "./gitstats", line 1490, in
g.run(sys.argv[1:])
File "./gitstats", line 1465, in run
data.collect(gitpath)
File "./gitstats", line 338, in collect
timezone = parts[3]
IndexError: list index out of range
Surprisingly, It works when I change the date to 4 month ago. I have no idea why it the error is occurring. Is it something related to different commits in repository?
Currently, if someone commits a large configuration file or some third-party code, it distorts the stats as well as the associated graphs. Is there a way to blacklist certain files or paths?
Lines of Code (with names removed)
Thanks,
-Anthony
P.S. I work in robotics, so large configuration files are not uncommon (e.g. a list of GPS way-points for a testing area)
I'd suggest plots using the "with steps" style rather than "with lines", so that large jumps e.g. in lines of code that come after a long no-commit period show up as a vertical jump following a horizontal line, rather than a diagonal line.
I try to run gitstats with Python 2.7.1 installed, however it says:
env: python2: No such file or directory
Would be nice to get the raw output as XML so it can be easily aggregated into a database.
http://gitstats.sourceforge.net/
repository is hosted on both GitHub and Gitorious.
-> Gitorious is closed, the link is dead.
Commits after a user changes his/her nickname are treated by commits by another user.
Hi,
I was unable to use gitstats on Windows (unless I run it through cygwin).
It complains that wc
command is unrecognized, which is expected as it's not present on Windows:
C:\Program Files>python C:\git\gitstats\gitstats C:\git\DscPublisher C:\git\GitS
tatsOutputDscPublisher
[0.03100] >> gnuplot --version
Output path: C:\git\GitStatsOutputDscPublisher
Git path: C:\git\DscPublisher
Collecting data...
'wc' is not recognized as an internal or external command,
operable program or batch file.
[0.03100] >> git shortlog -s HEAD | wc -l
Traceback (most recent call last):
File "C:\git\gitstats\gitstats", line 1490, in <module>
g.run(sys.argv[1:])
File "C:\git\gitstats\gitstats", line 1465, in run
data.collect(gitpath)
File "C:\git\gitstats\gitstats", line 289, in collect
self.total_authors += int(getpipeoutput(['git shortlog -s %s' % getlogrange(
), 'wc -l']))
ValueError: invalid literal for int() with base 10: ''
One of the lines which uses wc
is:
self.total_authors += int(getpipeoutput(['git shortlog -s %s' % getlogrange(), 'wc -l']))
When I tried running gitstats through cygwin, everything worked, but I don't see it listed as requirement anywhere, so I'd assume it should be running on Windows without it as well.
Hello
I'm currently writing my master thesis and use gitstats to track progress on my writing.
I'm missing two metrics to do so a little bit better.
a) Commits per day.
Showing a graph of the last 30 (maybe 90) days with the number of commits.
b) Word count
As I make notes, the document gets many lines, but as I reduce the codes to text, the number of lines reduces, so the graph isn't really helpful anymore. What I would need is the growing number of words.
Thanks for the nice project :)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.