Comments (11)
I definitely want to work on an updated version, just haven't had the time (pull requests are always welcome!). I'm hoping to take a stab at it this summer.
(Also, @sujato, the line breaks in the HTML are generated by Pandoc — I would never even write much HTML by hand anymore!)
from niddesa.
I'd be happy to get the data into Bilara once V2 is ready though.
Wow, that would be great. The only special consideration with putting it into the Bilara system is that the SuttaCentral policy is that those translations are CC0/public domain.
If you wanted a different license, then it would need to be added as html files into the legacy system. I think Bhante Sujato would be happy to have it added either way.
from niddesa.
I'm wondering if you have had time to work on this or if you see yourself having time in the near future. Just checking in.
from niddesa.
@thesunshade Yes! I've begun work on V2, partially based on this corpus (I still don't actually know Pali!). I haven't pushed that work yet due to its size (don't want to bloat the repo or deal with filter-branch). Right now I think I'm on track to have V2 ready to go sometime in October (a season later than I originally expected).
from niddesa.
Great to hear. Have you considered doing the translation using the Bilara software created for Sutta Central? That would allow the translation to be viewed (at the reader's discretion) side by side with the Pali. Any thoughts, Bhante @sujato?
from niddesa.
I took a look at Bilara, but I'm very comfortable with my current workflow (Vimmers get set in their ways). I'd be happy to get the data into Bilara once V2 is ready though.
from niddesa.
Hey just a quick update, V2 is nearly finished. I have about 140 passages that still need to be fixed, but then it'll be good to go, so probably by Sunday after next at the latest. I've also switched to CC0 (on the V2 branch). I'll probably end up doing a third major revision sometime when life slows down a bit and I'm better recovered from Covid, but V2 will be usable enough for most folks.
from niddesa.
Congratulations! And I wish you a speedy recovery.
Let me know what your interest is in putting the translation into Bilara.
from niddesa.
Thanks. I think it would be valuable to have in Bilara, though I didn't retain line numbers everywhere, due to rewording some passages and omitting some longer repetitions within the same chapters. Looks like the schema Bilara uses probably requires line numbers to be the same across translations? In the mean time, I could absolutely get the text into regular SC. I'm guessing that would mean the sc-data repo? I'm poking through the wikis right now trying to figure out the format, since I'm currently just generating a single long file for each filetype.
from niddesa.
Looks like the schema Bilara uses probably requires line numbers to be the same across translations?
Yes. You can see in bilara-data
how the translations are stored in json files with the keys being segment numbers. It's fine to have empty segments, though. This often happens with places with repetitions.
I'm guessing that would mean the sc-data repo?
Yes. My understanding is that the actual format of the html file is not so strict. I'm not sure if the segment numbers are even required in these texts. The technical term for these translations is "legacy translations".
Do you have an account on the SC discussion forum? I think that is usually how Bhante Sujato works with people on preparing translations.
from niddesa.
I'm going to close this because the work presumably has been taken over by someone else in the EBT community. There are more details in the sutta-central Discourse thread, but for others' reference: the main blocker for me was that I don't have time to reformat the whole text to align with the Pali verses, because I did this translation from a Chinese version that didn't preserve those verses and lines.
from niddesa.
Related Issues (7)
- Get ahold of the HTML from Sutta Central for Pali and Sinhala HOT 1
- Convert the raw HTML to Markdown for processing HOT 1
- Automated translate using Google Cloud Translate HOT 1
- Translation with Amazon Translate? HOT 1
- Clean up and combine auto translations HOT 1
- Write scripts to build for epub, pdf, html, etc HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from niddesa.