legacyai / tf-transformers Goto Github PK
View Code? Open in Web Editor NEWState of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).
License: Apache License 2.0
State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).
License: Apache License 2.0
This is great work!!! I have problem with TF2+HF with too many errors, reported to TF2, I aim to switch to tf-transformers. Though library did not work in colab, I guess there are some missing files? Thanks.
Would it be possible to add bert2gpt conversion? Thanks.
You said this library is 90 times faster than HF transformers, but there is no benchmark about it.
https://github.com/legacyai/tf-transformers/tree/main/benchmarks
I was reading the code for the HF GPT2 benchmark, and it seems like key-value caching is not being used? This is pretty important for any kind of autoregressive generation and would greatly speed up the decoding time. HF models have had support for key-value caching for a while, see config arguments use_cache
and past_key_values
here: https://huggingface.co/docs/transformers/model_doc/gpt2#transformers.GPT2LMHeadModel.
I think it would be important for this project to re-benchmark the HF models with key-value caching enabled, as that is standard practice and without it the HF numbers are being handicapped.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.