Giter Club home page Giter Club logo

Comments (3)

ltalirz avatar ltalirz commented on June 8, 2024 1

Below a point raised by @jeffhammond via email + a follow-up

Your citation counter uses https://scholar.google.com/scholar?q=NWChem%20Valiev&hl=en&as_sdt=0%2C5&as_ylo=2020&as_yhi=2020, which maps to "NWChem Valiev". Is there any particular reason why you only count NWChem citations associated with Marat Valiev? Are there false positives otherwise?

If you use "NWChem" alone, you see https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&as_ylo=2020&as_yhi=2020&q=NWChem&btnG= has 634 citations instead of 353. This is a nontrivial difference. I would be really surprised if any of these is not associated with the NWChem of interest.

If you want to be explicit about authors, "NWChem Windus OR de-Jong OR Kowalski OR Bylaska OR Apra OR Valiev OR Govind OR Harrison OR Straatsma" will get you 426 citations, with relatively high specificity.

The "code name" + "key author" approach is somewhat of a historical relict from the list created by Luca Ghiringhelli but there also is some thought behind it. While the "code name"-only approach would work for certain engines with highly unique names (NWchem is one of them), it would not work for other codes (think: ORCA, etc.).
Including author names in the queries for some codes but not for others would create a relative imbalance in the data set, e.g. in cases where the name of a code is mentioned in the text without a citation / author, or in cases where Google scholar hasn't fully indexed the references.

I think a first improvement over the "code name" + "key author" is indeed to include at least the first authors of recent review articles via OR, which I've now done for nwchem. However, I'm also open to discussing dropping the author from the query altogether, as well as any other suggestions for improving the general query approach.

P.S. One could argue that maintaining a list of review papers per code and simply summing up the corresponding citations would be superior to the current approach (even if then one will be double-counting some citations). The main drawbacks of this approach are that it significantly increases the maintenance burden, and that it would no longer be possible to direct users to the results of the query used to obtain the citation results (since multiple queries would be needed). If Google Scholar’s API ever adds support for combining multiple “cited by” queries into one, I will consider switching.

from atomistic-software.

sponce24 avatar sponce24 commented on June 8, 2024

Hello,

Very nice Leopold. Maybe EPW could be added (Google scholar 'EPW Giustino' returns 149 citations for 2020).

Thanks,
Samuel

from atomistic-software.

ltalirz avatar ltalirz commented on June 8, 2024

Thanks, Samuel! As you are one of the code authors, would you mind opening a PR to add EPW to https://github.com/ltalirz/atomistic-software/blob/master/src/data/codes.json? I think it would fit into the S=Spectroscopy category (that, by the way, is the least populated and least well-defined; probably misses out on a couple of other codes).
Or let me know which of the "tags" should apply in your view.

No need to add the citation numbers; I'll take care of that.

from atomistic-software.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.