Comments (10)
The PDF is almost ready, please give me a few more weeks.
The building workflow is ready, but I need to finish the stylesheets and restructuring the chapters.
If you're impatient, the instructions are here https://github.com/stas00/ml-engineering/tree/master/build - but as I said the stylesheet needs a bit more work.
from ml-engineering.
@stas00 Thanks for making this happen! I had a good read!
from ml-engineering.
That's an excellent idea, Zhangzhi.
If it can be automated for sure I'd be happy to receive such a PR.
Have you checked whether the github market perhaps already has an app to do that automatically?
from ml-engineering.
Agreed, @pengzhangzhi. A PDF copy would be nice. For locally converting markdown to a PDF, pandoc
seems pretty well known. However, in terms of GitHub automation for this, solutions seem a bit sparse. Perhaps the easiest way to do this would be to add a small blurb at the end of this project's README.md that instructs readers who would like to make a PDF copy of this repo to run the following commands mentioned in this post.
from ml-engineering.
Awesome! Let's do it step by step. I will first write a script to generate pdf locally and then figure out ways to automate this by GitHub automation.
from ml-engineering.
the grip recipe won't work well since this is a multi-file situation. the SO answer @amorehead linked to mentions https://wkhtmltopdf.org/downloads.html which is probably a much better tool for building pdfs from a set of files. So probably need to convert to html first with pandoc and then wkhtmltopdf that.
from ml-engineering.
I would consider to build the pdf through Latex.
In terms of advantages, its rendering is best; it is also way more stable and consistent than html-based solutions (weasyprint and wkhtml2pdf have very different looking results, wkhtml2pdf has not been supported for a while, it's a mess).
In terms of inconvenient its install size is terrible. It's not also technically 'reproducible build' unless you follow a similar approach to what is done by https://github.com/hendricius/the-sourdough-framework (even without that it's much better than the html-based alternatives). Their pdf result (https://www.the-bread-code.io/book.pdf) is kind of perfect too.
from ml-engineering.
Thanks! How to convert markdown to latex... Any framework?
from ml-engineering.
the pdf is finally done:
https://github.com/stas00/ml-engineering#pdf-version
from ml-engineering.
Thank you for validating that it is a good read, @pengzhangzhi
from ml-engineering.
Related Issues (18)
- Parallel training hangs HOT 10
- Daisy chain batch jobs HOT 1
- Improve folder structure HOT 3
- Convert to bfloat16 failing HOT 2
- pip install -r build/requirements.txt fails due to github_md_utils HOT 3
- Clarification for gradient memory in mixed precision training HOT 3
- Quarto Site HOT 3
- Conflicting opinions about streaming data from cloud storage? HOT 2
- discuss the solutions to Not fully recovering spikes HOT 7
- ML
- TPU v4 has 1,200GB/s of mem bandwidth and not 2,400, right? HOT 1
- Question about changing precision post training HOT 2
- Question about the right hidden dim when using SwiGLU HOT 3
- Missing `hparams` section HOT 2
- Adding another logbook (kinda) HOT 1
- Minor Typo in emulate multi node HOT 4
- GPU requirements and cost estimation. HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ml-engineering.