Comments (5)
Oh, so do you mean adding an option to use Stanza? Hmm, I'm familiar with both Stanza and spaCy, but the biggest trouble for me would be dealing with Spanish texts. I only know Spanish at a very introductive level.
Anyways, I looked through Entity Grid and TTR features, which both seem to require minimal Spanish skills. I'll first create a pull request (in a few days) for these files. I'll try to add options to use Stanza rather than fully migrate to Stanza. One could then choose which to use.
from trunajod2.0.
Thank you for open-sourcing this repo! It's helping a lot with my research.
Regrading Stanza migration, unless you have a tight deadline, I could help. However, I doubt the accuracy would improve by much. SpaCy had a major improvement quite recently https://spacy.io/usage/v3. But, of course, Stanza would look much better for research papers.
from trunajod2.0.
Hello Bruce! Sure, I don't have a tight deadline, so your contribution is more than welcome! There are some differences in stanza pre-trained models compared to spacy ones, so maybe I am not sure if completely migrating it, but having the alternative of using stanza models instead of spacy might improve performance in some cases!
from trunajod2.0.
I mean, initially I wanted to completely replace spacy, but as you mentioned, spacy improved over time, so maybe removing all the spacy references will not be as good as having options for both stanza and spacy. No worries regarding Spanish related features. I can update them. BTW thanks for your desire to contribute!
from trunajod2.0.
No worries. I'm also working on a similar project so it'll help me too anyways :)
from trunajod2.0.
Related Issues (20)
- Add type hints to lexico semantic norms HOT 1
- Add type hints to semantic_measures.py HOT 1
- Add type hints to surface_proxies
- Add type hints to syllabizer HOT 3
- Add type hints to utils
- Specify Python version HOT 1
- Paper grammar review HOT 2
- Missing community guidelines HOT 2
- Error while running example HOT 5
- State of the field HOT 3
- Include additional examples on the documentation HOT 3
- Complete / expand contribution guidelines HOT 2
- Fix typo in Yule's K
- Add word variation index
- Improve `is_word` function HOT 1
- Implement universal POS tags ratio
- Implement Salience model for Entity Grid
- Documentation is inconsistent with current released version
- Add givenness approximation based on semantic measurements
- Add coherence measurements based on syntactic patterns
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from trunajod2.0.