mkranj / paperscited Goto Github PK
View Code? Open in Web Editor NEWList all unique citations in your document
License: GNU General Public License v3.0
List all unique citations in your document
License: GNU General Public License v3.0
When citing works by multiple authors the "and" separator in different languages leads the program to conclude those are separate citations. This could occur e.g. in the English abstract of a croatian article.
"A and B, 2000" and "A i B, 2000" should be logged as one citation. Preferably "A i B, 2000" since Croatian matches mean the article is probably in Croatian.
Both "A and B, 2000" and "A i B, 2000" get recorded as separate citations.
Names with the "ç" don't get recorded as parts of citations.
The text includes the following sentence: "Mendonça i suradnici (2009) utvrdili su..."
The Excel file should contain "Mendonça i sur. 2009"
Excel contains "sur. 2009"
If any .txt file is loaded, a warning pops up that ANSI encoding is preferred.
If the encoding of the file can be determined, show the warning only if it is not already ANSI.
In the sentence Cohen's (1999) seminal paper..., an article by Cohen (1999) is cited and should be recorded as such. Currently, the citation appears as Cohen's 1999, instead of Cohen 1999.
This could also lead to duplicate citations being recorded if both the possessive and regular form occur.
The software does not pick up unpublished papers cited as Author(s) (in press) or Author(s) (u tisku - in Croatian).
Certain phrases in the format of word, long number get recognised as citations, e.g. Washington, DC, 10694(000).
The mentioned text doesn't get written in Excel.
The mentioned text returns a citation "DC 1069"
Currently, no text in footnotes is analyzed. So, if it contains additional citations, they won't be recorded in the output file.
The program closes after finishing gathering citations for a given file. Offer an option to analyze another file instead of closing.
Some texts contain lots of single words followed by a string of numbers, which falsely get detected as authors.
Limiting first authors to only uppercase words would avoid this issue. This applies to solo authored works, pairs, and threes.
The text includes the following sentence: "Review of psychology (1234-5678-1234)..."
The Excel file should not contain anything from this sentence.
Excel contains "psychology 1234"
Misspeling author names will prevent the program from recognising them.
.Csv should consist of two columns, narrower and wider citations, with no empty columns provided.
Authors can be reffered to in different cases in Croatian (padeži). For example, Cohen, Cohenu, Cohenom. Possesive forms also alter the words - Cohenov. These all refer to the same source.
Cohen's (1999) paper is the same as the one referenced by Cohen (1999).
Recognise different word endings in citations. If two citations differ only in the last letter, keep only one. Same for citations ending in 's.
Superov (1999) članak potaknuo je daljnju diskusiju. (...) Na kraju, nitko nije imao bolju ideju od Supera (1999).
With recognising case, the citation in the Excel file would be a single Super 1999.
Right now the program detects multiple sources when they are listed directly after one another. However, they are recorded in one cell, and possibly duplicate other citations when they reference just one of the years listed.
A function that separates such citations into individual ones would help promote readability and reduce duplicated sources. The sources should be listed as such in the reference list.
The text includes the following sentence: "AuthorA (2000; 2002; 2003) thoroughly explored..."
The Excel file should list "AuthorA 2000", "AuthorA 2002" and "AuthorA 2003" as separate citations.
Excel contains a citation "AuthorA 2000 2002 2003"
This requires #40 to be implemented first, because clipboard text will have no default filepath to save to.
Individual characters that are "allowed" are listed in the program.
Omission of characters such as æ, ø, å...
Import a list of Unicode characters (that doesn't include numbers, special symbols...) and use that for maximum coverage.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.