Automatic profiling of social media users from the analysis of unstructured data. Early is a Django web application for building datasets, correcting data and validating data from the automatic profiling of social media users from the analysis of unstructured data. It also allows you to visualize social media profiles, with the predicted demographic data and the automatically filled Beck Depression Inventory.
- Create datasets:
- Source data: Reddit
- Configure the experiments by subreddit, number of users, number of comments per user, etc.
- Profile retrieved data by consuming Profiling Buddy API
- Create corpus to classify the datasets
- Export data:
- Demographic data
- Dataset
- Labeled dataset
- JSON or CSV format
- Profile user by Reddit username
- View profiles list
- Search and filter profiles
- View profile detail
- Edit and validate profile
- Correct Beck Depression Inventory questionnaire
Check the OpenAPI specification for more information regarding the API.
Run as a container with:
- Docker
- docker-compose
Clone the project, install docker and start the service with docker-compose up
For create the database super user run the following command docker exec -it <early_web_container_id> python manage.py createsuperuser
- Retrieve data from Twitter
- Improve search engine
- Manage user groups (a user group can access a subset of profiles, not all profiles)
- Add statistics and data visualization of the demographic data of the profiles
- i18n, right now only available in English and Spanish
GNU GPLv3.0