Comments (4)
Resolved itself? : )
Anyways, your problem is a critical point. We are currently improving the error messages when dealing with custom input data. The code hidden in the parent Processor class can be sometimes confusing for people new to FARM. You can visually inspect your data processing when we log samples to the console. Or you go in debug mode and see how the data flows, especially along the functions defined here.
Normally this error comes from input data not in the right format. Formatting issues occur frequently when dealing with comma separated files and a text column often containing the separator or newlines (or any other wierd string symbols).
So please hang in there and ask questions through our github issues page here.
from farm.
Hi, sorry accidentally pressed enter before i was done typing :D I have updated my comment from earlier.
Thanks for your support so far, will look into it tomorrow again!
from farm.
Hey @amoelle
Thanks for updating the issue. Could you try renaming the "Tonalität des Artikels/Themas" column to anything but "text_classification_label". This is our internal string value, that gets assigned to label_name. If that solves the issue, we will need to add a check for that very case.
Otherwise could you verify with self.df['Tonalität des Artikels/Themas'].unique()
that really all values occuring in that column are covered by the label list?
Thanks, looking forward to your reply tomorrow.
from farm.
Renaming the label column seems to work, thank you! :)
from farm.
Related Issues (20)
- MTL Processor QA + Classification HOT 1
- Querying API Docker examples HOT 1
- Should be possible to use the proper aggregated loss for early stopping HOT 3
- AdaptiveModel.convert_to_onnx does not save float16 model conversion to output_path HOT 1
- ONNXAdaptiveModel causes NameError: name 'onnxruntime' is not defined HOT 1
- Error reporting using other pre training models HOT 2
- how to predict on single data points for classification problem.? HOT 2
- Error Importing Inferencer HOT 4
- Retreiver Fine Tuning : Are language models like roberta, gpt2 supported to use in retreiver? HOT 3
- Can't train a language model HOT 4
- Max token size? HOT 2
- summarization HOT 1
- Combine several models into one with several prediction heads HOT 2
- Need a guidance on Multi label Classification HOT 2
- Extract embedding while using parameter "extraction_strategy="per_token"" HOT 1
- Which pytorch (and other package) versions are actually required HOT 4
- Current version `0.8.1-snapshot` is not valid according to PEP 440 and causes installation problems HOT 1
- Columns and DataType Not Explicitly Set on line 147 of wordembedding_utils.py
- Installation error
- IndexError: too many indices for array: array is 2-dimensional, but 3 were indexed
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from farm.