Comments (4)
hello, kenny, I may have time add a small update for this
but I have few questions
- what should default install command
pip install langcheck
do, would it only install support for default language, like [en] e.g. or let it install supoprt for all language? install former way seem easier to realized. - if split like this
dev = [
"yapf==0.40.1",
"flake8",
"isort",
"pyright",
"pandas-stubs",
"pytest",
"sphinx",
"sphinx-book-theme",
"myst-parser",
"ipykernel"
]
zh = [
'hanlp >= 2.1.0b33',
]
ja = [
'fugashi', # For tokenizer of metrics.ja.toxicity()
'janome >= 0.3.1',
]
the github action file may also need a small update like, pip install -e .[dev,zh,ja], e.g.
what's your opinion ?
from langcheck.
Hi @Vela-zz, that would be fantastic!
- what should default install command
pip install langcheck
do, would it only install support for default language, like [en] e.g. or let it install supoprt for all language? install former way seem easier to realized.
I think we can make pip install langcheck
install a default language (English) to improve the default installation process.
the github action file may also need a small update like, pip install -e .[dev,zh,ja], e.g.
Agreed, we'll need to update all GH Actions, including the new Pip Install Matrix (we might want to add a third matrix dimension for "language").
dev = [ "yapf==0.40.1", "flake8", "isort", "pyright", "pandas-stubs", "pytest", "sphinx", "sphinx-book-theme", "myst-parser", "ipykernel" ] zh = [ 'hanlp >= 2.1.0b33', ] ja = [ 'fugashi', # For tokenizer of metrics.ja.toxicity() 'janome >= 0.3.1', ]
@liwii do you know which dependencies in pyproject.toml are Japanese-only?
@ischender do you know which dependencies in pyproject.toml are German-only? (if any?)
I'm happy to help you with parts of this PR, e.g. I can update GitHub Actions, if you want.
from langcheck.
Also, not sure if this is possible, but it might be nice to have pip install langcheck[all]
which installs all languages without dev dependencies.
from langcheck.
That would be neat!
@kennysong , I am pretty sure the only German only part is that we download the translation model from Huggingface, but that happens when it's run the first time.
from langcheck.
Related Issues (20)
- Installing tensorflow breaks the transformers library in `test_factual_consistency()` HOT 2
- Fix `NOQA E501` showing up in documentation
- Add "refusal to answer" metric HOT 2
- Visualize thresholds when plotting MetricValueWithThresholds HOT 2
- Fix documentation page title for MetricValue
- Add versions to ReadTheDocs documentation HOT 1
- Support OpenAI >= 1
- Installation problem on Python 3.10 on Apple Silicon Mac
- Eliminate unexpected spaces introduced before periods by `TreebankWordDetokenizer().detokenize()`.
- Allow metrics to run on GPU
- Create langcheck.utils.detect_language() HOT 7
- Add docstrings for object properties so they show up in documentation
- Pin versions for HuggingFace models HOT 5
- Specify langcheck.__version__
- Consider computing German fluency without translating to English
- Improve testing of installation on different machines and python base images
- Document maximum allowed input lengths for each metric
- Follow up `use_async` option HOT 1
- Improve the stability of metrics by repeated queries
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from langcheck.