Comments (16)
Hey @NirantK , in code examples can I add how one can use CountVectorizer and Tf-idfVectorizer? And how pre-trained vectors can be used in NLP?
from awesome-nlp.
Hey @anu0012 , if the vectorization methods specific to Dialogs and chatbots - feel free to add them to that section. I guess they are not.
If you are asking if you can add them in general, consider adding them to the tutorial section - if these are not already covered.
from awesome-nlp.
https://github.com/anu0012/Predict_the_happiness_challenge/blob/master/notebook.ipynb
In this notebook, I have used Tf-IDF Vectorizer. I used several concepts like text-cleaning, lemmatization, stemming etc. in this script. Can I add this?
from awesome-nlp.
No, @anu0012 that does not meet our requirements just yet. Please refer the tutorials section to get an estimate of the quality needed to be included here.
I am sure you can polish it to make it awesome and help the community in the process!
from awesome-nlp.
@NirantK, I first checked the links here.
- the RNNLM toolkit link is broken.
- The other papers have working links
New stuff worth adding
- SPMF , a Java library for pattern mining
- A Sequential pattern mining tutorial and a 'hands-on' thingy
- This code repo is dual LSTM encoder for dialog response generation from the Ubuntu corpus.
Anything I am missing out or mistaking for something?
from awesome-nlp.
from awesome-nlp.
I understand. 😄 . I mistook it for something. No problem, I will open up the needed issue and look more in the DialogCI and ubuntu corpus thing
from awesome-nlp.
https://www.tidytextmining.com
I think this can be added in reading section. What do you think @NirantK ?
from awesome-nlp.
@anu0012 good find. Since this is an entire book and not a one-off tutorial, let's create a new section under tutorials Books and add there.
This becomes our excuse to make some progress on #5 as well.
from awesome-nlp.
Thank you @the-ethan-hunt.
I have fixed the broken link and closed #105.
As a quick note, Dialogflow is a tool for making Human-Computer Interaction systems (or HCI). In layman words, it is a tool for making chatbots.
from awesome-nlp.
Hey @the-ethan-hunt, do consider continue contributing to awesome-nlp. Take a look at this issue if you'd like :)
from awesome-nlp.
Sure @NirantK ! But is there any other issue I might possibly work on? 😅
from awesome-nlp.
Sure @the-ethan-hunt.
Thanks for adding Korean from #98 but did not make enough progress on Chinese, Japanese or any European languages for that matter. It'd be awesome if we'd take that issue to its due conclusion.
It saves a lot of time for the community to have all of the best tools for a particular language in one place.
from awesome-nlp.
Hey @anu0012, are you still interested in working on this? We could really appreciate a hand here :)
from awesome-nlp.
Sure @NirantK. In the second point which you mentioned what type of code examples and dataset can be added?
from awesome-nlp.
@anu0012 Chatbots, virtual assistants and any other popular form of conversational interfaces is a good starting point.
E.g. there is some work on chatbots from Microsoft and Facebook both, check for what datasets they've used and if we can mention them here. Similarly, there is some work on intent detection etc, maybe look if that is relevant?
If at the end of all of this search, we are still unsatisifed with the quality and breadth of coverage, maybe we can merge this section with Conversational Q&A which has similar technical challenges imho. I'd be mostly going by your (and community's) recommendation and findings on the same.
from awesome-nlp.
Related Issues (20)
- Chazutsu is a Python library for reading standard text datasets HOT 2
- Participation in open source coding programs HOT 18
- Reproducing results of: A Decomposable Attention Model for Natural Language Inference for Hindi HOT 3
- Develop Hindi models for Tokenization,POS Tagging and NER HOT 5
- Reproduce Hierarchical Attention Networks for Document Classification for German HOT 4
- Adding research topic specific section HOT 2
- Traditional Chinese(zh-TW) language support HOT 2
- Add Research Labs HOT 2
- Another NLP tool HOT 2
- Suggestion
- Any support for Gujarati HOT 5
- Add standfordNLP (python) HOT 1
- Add the best state of the art reference HOT 1
- Which kind of model is better for keyword-set classification? HOT 3
- Broken link on readme.md HOT 1
- Nlp
- Grouping entites
- NLP in Japanese HOT 2
- Looking for a successor HOT 1
- Please add VideoDubber.ai for AI Video Translation
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from awesome-nlp.