Comments (6)
@shahrukhx01 When I created this ticket and now I encountered many different ways to solve it. Each have some pros and some cons. Ideally I would like to build this feature such that we can plug any framework, because it directly linked to production deployment. Currently our target to build good user base who try Obsei and once it build then we can think about it that is the reason for lowest priority.
Few things we considered -
- Ray
- temporal
- airflow
- networkx
- containers (similar to what Jina does)
- Loky for local parallelism
- Pathos for local level parallelism
- Apache Beam (Pipeline IO concept can be borrowed) https://beam.apache.org/documentation/io/built-in/
If you have any suggestion regarding how we can abstract it then it would be great.
I am adding @GirishPatel to this conversation as well.
from obsei.
I see, I will explore orthe options and add to the alternatives you mentioned. Maybe before that, I can first finish #75 and focus on medium/high priority issues first then.
from obsei.
Our aim to build NLP based open source workflows like https://github.com/n8n-io/n8n
Currently we supporting text but there is plan to support image and audio based workflows.
We are very far from it. But hoping one day we can achieve this target
from obsei.
@lalitpagaria for this point "Introduce DAG based workflow. Need to finalise between networkx or airflow" please do let me know when you plan to start this implementation, as I'm seriously interested in working on this feature, as it'd also help me understand Haystack's codebase. thanks!
from obsei.
eventually, we'll get there!
from obsei.
Closing this and moving discussion to #145
from obsei.
Related Issues (20)
- Explore NetworkX to create complex workflows
- Add regex based function in TextCleaner
- Add paragraph and sentence boundary based splitting capability in TextSplitter HOT 2
- Question: How to deal with large source results HOT 3
- [BUG] Google News only return 100 query even if max_results is set at 1000 HOT 4
- [BUG] Facebook source failing with unexpected keyword `long_term_token`
- Integrate Freshdesk, Salesforce and SAP
- [BUG] Import issue on Python 3.7 version
- Make StrongCopyleft dependencies optional HOT 1
- [Observer] Youtube comments integration HOT 1
- google Colab getting "no module named 'dateparser'" HOT 11
- ModuleNotFoundError: No module named 'torch' HOT 5
- [BUG] TypeError: '<' not supported between instances of 'datetime.datetime' and 'NoneType' HOT 5
- [BUG]TwitterSourceConfig - AttributeError: At least one non empty parameter required (query, keywords, hashtags, and usernames) HOT 5
- [Observer] Add Youtube Transcript support
- [BUG] Map Review observer not honouring cutoff date
- More granular dependency division to choose analyzer dependencies
- Tiyaro API integration for analyzer HOT 1
- Fix obsei website
- OpenAI GPT3 integration as analyzer
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from obsei.