sandrabotica Goto Github PK
Name: Sandra Botica
Type: User
Bio: Data Analyst | Health & Physical Education Teacher | Pilates Instructor
Location: Perth, Australia
Name: Sandra Botica
Type: User
Bio: Data Analyst | Health & Physical Education Teacher | Pilates Instructor
Location: Perth, Australia
Module 14 Challenge involves building a dashboard to explore an external site with a microbe dataset. Horizontal bar chart, dropdown menu, bubble chart and metadata table of sample data. App deployed to a static hosting page.
Module 20 Challenge uses Supervised Machine Learning to predict loan risk of borrowers. Train-test-split, Logistic Regression Model, Confusion Matrix and Classification Report analysis.
Module 13 Challenge is a ETL pipeline using Python dictionary methods and regular expressions to extract and transform data. CSVs used to create an ERD table schema and uploaded to a Postgres database.
Module 19 Challenge uses unsupervised machine Learning to predict if Cryptopcurrencies are affected by 24hour or 7day price changes. Normalise data, K-Means Clustering, elbow method and PCA analysis.
Module 21 Challenge uses a Neural Network Model as a binary classifier to predict if a funding organisation will be successful in their application for funds based on features in a dataset. Compile, Train, Evaluate, Optimise the model.
Module 22 Challenge, Big Data, use of SparkSQL to determine key metrics about home sales data. Temporary views, partitioned data, cached and uncached a temporary table, and verifying table as uncached.
Module 15 Challenge uses Leaflet to visualise an earthquake dataset from USGS. TileLayer loads and connects to a geojson API using D3 and tooltip for each data point, scaled with magnitude and colour based on depth.
Supervised ML model implementation and optimisation for heart disease prediction. Logistic Regression, Random Forest and a Neural Network. Tableau visualisation of data.
Module 18 Challenge involves aggregating data from an external site, the CitiBikeNY History log to present 2 unexpected phenomena in a Tableau Dashboard. Metrics included, analytic rigor, readability and visual appeal.
Module 11 Challenge involves web-scraping and data analysis by identifying HTML elements (id/classes attributes) on a page. Extract info with Splinter and HTML parsing using Beautiful Soup.
Module 5 Challenge using Matplotlib, reporting clinical study results. Prepare data, generate summary statistics, create bar/pie/line/scatter/box charts/plots, calculate quartiles, correlation, regression and identify outliers.
Module 12 Challenge uses MongoDB to connect and update database collections on restaurant ratings. Using a Jupyter notebook to perform queries to answer questions about restaurants.
Module 4 Challenge using Pandas and Jupyter Notebook to aggregate trends in school performance to create a report on observable trends based on the data.
A data story on global coffee trade. Dashboard of user-driven visualisations using Plotly. Python Flask API created which includes HTML/CSS, JavaScript and a database using ERD and PostgresSQL.
Module 6 Challenge using Python requests, APIs (OpenWeatherMap & Geoapify) and JSON traversals. Scatter plots and linear regression showcase weather variables and latitude relationships. Mapping hotels within a certain radius of ideal weather conditions.
Module 3 Challenge using Python scripts to analyse financial records and vote-counts from polling data. Import/read/write csv files, tuples, lists, dictionaries, iterating through data and debugging.
Module 9 Challenge involves data modelling, engineering and analysis. Use of QuickDBD to sketch an ERD and table schemas, data type, primary and foreign key relationships. SQL queries for data analysis.
Module 10 Challenge is a climate analysis of a vacation destination. SQLAlchemy ORM queries, Pandas and Matplotlib used for climate data exploration. Precipitation and Stations data converted to json and linked to Flask API.
Module 2 Challenge using VBA scripting to analyse generated stock market data. Looping through data and reading/storing specific values, creating columns, conditional formatting and calculations.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.