Comments (3)
Reject meaning zero out or don't output the vector entirely? This for cases like outliers or one hot encoded enums where the aggregator doesn't have an entry?
from featran.
Not zero, but some separate output indicating some error, e.g. out of bound, outlier. Users can choose to keep the output (likely zeroed out) or filter the entire records out if error exists.
We can represent FeatureBuilder
output something like a Try[T]
or cats Validation
.
from featran.
I see, makes sense. Exposing errors and allowing people to decide what to do with them is always a good path. If nothing else there will be cases where the function that is passed to the transformer will hit malformed data.
from featran.
Related Issues (20)
- Can we use scaladoc 2.12? HOT 2
- Add documentation site with paradox HOT 1
- Add Scala Binary Compatibility validation tool – "MiMa"
- Performance issue in TensorFlow FeatureBuilder HOT 1
- PositionEncoder doesn't support input as "Seq" of Strings HOT 3
- Feature transformations order lost after filtering on a MultiFeatureSpec HOT 1
- Use JsonSerializable typeclass for FlatReader[String] and FlatWriter[String]
- Upgrade TensorFlow to 1.9.0 HOT 3
- FlatExtractor performance. HOT 1
- Add java api for FlatConverter & FaltExtractor
- `featran` root artifact published by mistake
- Switch xgboost to official release package
- Is featran thread-safe and can be intergrated in akka
- Sequential composition of transformers HOT 2
- sbt `release skip-tests` not skipping tests? HOT 1
- Implementing feature transformer when the Aggregator and the Transformer input are of different types HOT 9
- Add dotty cross-compile support
- Can't mix `featran-xgboost` dependency with newer versions of xgboost
- Update TensorFlow to >=2.3.1 HOT 1
- Could you help upgrade the vulnerble dependency in featran?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from featran.