andrewgbruce / statistics-for-data-scientists Goto Github PK
View Code? Open in Web Editor NEWCode and data associated with the book "Statistics for Data Scientists: 50 Essential Concepts"
Code and data associated with the book "Statistics for Data Scientists: 50 Essential Concepts"
I am working through the example of session times in Chapter 3 > Permutation Test > Example: Web Stickiness. I have assumed that the data to which this example refers is that located in the file web_page_data.csv.
For pages A and B, respectively, it seems that the time is listed in the table as minutes, because the correct result for mean_b - mean_a (21.4) is obtained after converting to seconds. However, if this is correct, then Figure 3-3 is not. Figure 3-3 appears to show Time as minutes100, rather than as minutes60 (i.e., seconds). Which is correct?
Thanks!
Where will I be able to access the code samples and example datasets of the book????
The book uses a couple of datasets for coding examples. When will they become available here?
The download_data.r script assumes the presence of a /data/ directory as in ~/statistics-for-data-scientists/data/
The user could create this manually or add a line to the script to do so.
In the source code, there is a library named ascii, which seems to be a drawing package. But I cannot find anywhere to download the package. Why? Where
As recommended in the book I downloaded from GitHub R source file and when I execute them I get the following error message while downloading the data from Google drive.
Error in curl::curl_fetch_disk(url, x$path, handle = handle) :
Failed to open file C:\Users\amitk\OneDrive\Documents\statistics-for-data-scientists\data\state.csv.
I tried to find the solution for this error message but didn't get any luck. This is a humble request to kindly share the datasets so that I can practise the same.
Where are the ".png" files used in the program located?
What does the following code mean in line 24 of chapter2.r?
`
stat_fun <- function(x, idx) median(x[idx])
boot_obj <- boot(loans_income, R = 1000, statistic=stat_fun)
`
I'm not familiar with the RใI can't find any figure or content in the book refer to this code.
I try to reproduct this to Python. If you plz give me some advise.
Thanks!
in your prep_datasets.r
you load the data file "state_populations.csv" and "murder_rate.csv"
state_pop <- read.csv("/Users/andrewbruce1/book/state_populations.csv")
murder_rate <- read.csv("/Users/andrewbruce1/book/murder_rate.csv")
but in your download_data.r
you didn't download the data file "state_populations.csv" and "murder_rate.csv"
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.