Comments (3)
As far as I can tell, the Repair AV workflow was causing this data transfer. My understanding of this problem is as follows:
- We had a lot of videos (24) in the queue from the admin-upload process, and the workflow doesn't commit the ids of completed videos if it is unexpectedly killed mid-way through.
- We had some videos over 1gb in size, and it was while transcoding one of these that the workflow process was killed every time, presumably due to the file size.
- Because the workflow was scheduled hourly, a new run would start (often even before the first run had failed), and begin working on the exact same videos until it failed by hitting the very large files.
For now, I have gone through the 24 un-transcoded videos in s3 and put the ids of those that are >= 1gb in size into a new txt file that the repair av script will reference and use to skip processing those videos. Ultimately, we'll need to compress and re-upload those particular videos.
from covid-histories.
@mukeshchugani10 One change to the code that would help address issues like this in the future would be a modifications of the workflow script such that each time a video is processed successfully, the updated list of video ids is committed to the repo. As far as I can tell, this commit only happens after all of the videos have been processed, which ultimately was what caused this particular issue.
We can set up a different ticket/workflow for handling very large (>1gb) video files, though, arguably, it would be better this size is never uploaded to s3 and we have some preprocessing, etc. Will figure that out with the next batch of uploads.
from covid-histories.
Related Issues (20)
- Consider new photos HOT 1
- Better error text HOT 2
- Add tagging and approval functionality HOT 1
- [BUG] Webm videos don't show correctly on iphone HOT 1
- [FEATURE] Finalize phone workflow HOT 1
- [FEATURE] Add nsfw.js to filter submission HOT 1
- Update language on splash page HOT 2
- Action failing, checkout Region env variable in repo HOT 2
- Possible bug in admin-upload process HOT 2
- Add preview of how the stories will look on the map on submission platform HOT 1
- Approved stories revert to Unreviewed seemingly at random HOT 3
- Some videos only have audio (no visual) HOT 1
- Remove Google Analytics tags
- Typo: email address HOT 1
- Use Inclusive langauge: talk HOT 1
- Consider Different Script Font HOT 2
- Standardize Scrolling Height on home Page HOT 1
- Make Submission Pipeline "Next" interaction more obvious HOT 2
- Bad actor considerations HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from covid-histories.