Giter Club home page Giter Club logo

ml_movie's Introduction

ML_Movie

README

Introduction

Welcome to this GitHub repository! This project utilizes the latest GPT-4 model from OpenAI to generate high-quality datasets, focusing on detailed descriptions of each frame in movie scenes, including the structure of the scene, camera angles, actors, emotions, and plot elements.

Important Updates

  • Adoption of the New GPT-4 Model: We've shifted to the newly released GPT-4 model by OpenAI, resulting in significant improvements in the quality of our datasets.
  • Code Updates: All related codes for dataset generation have been updated and are now centralized in man.py.
  • Discontinuation of Previous Installation and Setup Processes: We are in the process of revising our installation guidelines.

Accessing Dataset and Models

If you wish to access our dataset and the fine-tuned Stable Diffusion model, please visit our Hugging Face repository. (Replace # with the actual URL to your Hugging Face repository)

Model Demonstrations

Below are demonstrations of the results from our fine-tuned model compared to the non-fine-tuned model:

testing text:

"The image provided appears to be an underwater shot, presumably of the bow of the RMS Titanic as it lies on the ocean floor. Unfortunately, due to the darkness and the low resolution in the image, specific details are not clearly visible. In scenes such as this in the movie ""Titanic,"" characters would typically be situated in a submarine or remotely operated vehicle (ROV), exploring the wreckage of Titanic. The spatial relationship between characters during these explorations would typically involve the characters inside the submersible looking out at the shipwreck and directing the ROV to maneuver around the ship's remains. The subtitle, ""Okay, take her up and over the bow rail,"" suggests that someone is instructing a remote operator or a pilot of a submersible to navigate the vehicle in a specific manner around the ship's wreckage. This instruction would mean that the vehicle should ascend and move over the bow rail of the ship's wreckage, which is a part of the ship's structure at the front. The spatial relationship in this context involves the remote vehicle moving in the water relative to the massive, stationary wreck of the Titanic. This movement is part of the explorative plot in the movie where the characters are investigating the remains of the sunken ship. In the scene, elements that would typically be in the foreground include the illumination from the vehicle's lights and possibly parts of the vehicle itself. In the background, you would usually see parts of the shipwreck bathed in the eerie glow of the submersible's lights, giving the audience a sense of the ghostly, haunting atmosphere of the deep ocean and the tragedy that occurred there. The layout enhances the emotional impact of the film by conveying a sense of exploration, historical intrigue, and the somber reality of the Titanic's fate."

Fine-Tuned Model

Fine-Tuned Model Fine-Tuned Model

Non-Fine-Tuned Model

Non-Fine-Tuned Model Non-Fine-Tuned Model

Installation Instructions

The new installation guide is currently being developed. Depending on your computer's environment, you may need to add various database dependencies. We will provide a detailed installation and configuration guide as soon as possible.

Dataset Changes

  • Discontinuation of MovieNet Database: We've completely abandoned the MovieNet database as the datasets generated with the GPT-4 model are far superior in clarity and accuracy.
  • Improved Clarity and Accuracy: The data generated by the GPT-4 model is not only clearer but also more precise in descriptions.

Branch Information

  • Demo Branch: You can view the old code in the Demo Branch.
  • Main Branch: The new version of the code will be continuously refined and iterated in the Main Branch.

Usage

  1. Clone the repository to your local machine.
  2. Ensure Python and the required libraries are installed (installation guide is being updated).
  3. Run man.py to generate the dataset as per the instructions.

Contribution

Contributions to this project are welcome, either through submitting Pull Requests to improve the code or functionality, or by submitting new ideas or bug reports through Issues.

ml_movie's People

Contributors

5418xr avatar sk204478 avatar zzhiyuan59 avatar

Watchers

 avatar

ml_movie's Issues

Review Project Papers & Devise Detailed Solutions

Title: Review Project Papers & Devise Detailed Solutions


Issue Description:

To ensure a comprehensive understanding and alignment on our project, it's crucial for every member to read the related papers associated with our project. Once acquainted, please draft detailed solutions for the segments you're responsible for.


Detailed Instructions:

  1. Paper Review: Make sure you've gone through all the relevant papers connected to our project. Should there be any queries or if you require any additional literature, kindly communicate with the team promptly.

  2. Solution Formulation: On grasping the contents of the papers, chalk out detailed solutions for your designated segment. Ensure your solutions are both specific and actionable, aligning with our project objectives.

  3. Share & Discuss: Be prepared to share and discuss your proposed solutions on Discord from 1 PM to 2 PM this Sunday. Ensure your availability during this window and be ready to share.


Expected Outcome:

  • A deep-rooted understanding of the project among all team members.
  • Actionable solutions from each member tailored to the project's requirements.
  • A finalized approach post-discussion, ensuring everyone is on the same page.

Deadline: Kindly complete the above tasks and be present for the discussion on Discord between 1 PM to 2 PM this Sunday.

Looking forward to an active participation from all and ensuring the success of our project. Thanks for your collaboration and effort!


@ALL

Meeting Summary

  1. Dataset Exploration
    Over the next week, the team will focus on the movienet and other similar datasets. The objective is to identify reliable and usable data examples.
  2. Progress Report
    We will have a progress update on Thursday. Team members are requested to ensure that their respective tasks are completed and be prepared to report their progress by then.
  3. Demo Showcase
    We now have a demo based on minGPT4 that can read images and provide details about characters and scenes in the image. I'll be uploading it to GitHub later today. Team members are encouraged to test and provide feedback on this demo.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.