麵嚮數據科學傢的實用統計學(影印版) [1. Practical Statistics for Data Scientists] pdf epub mobi txt 電子書 下載
內容簡介
很多數據科學資源包括瞭統計方法,但是欠缺具有深度的統計學視角。如果你熟悉R語言編程,也對統計學有所瞭解,這份快速參考將幫助你搭建易學可達的知識橋梁。
你將從這本書中學到:
? 為什麼探究式數據分析是數據科學的入門關鍵
? 隨機采樣如何減少偏見並産生高質量的數據集,即便用於大數據
? 實驗設計原則如何生成針對問題的答案
? 如何使用迴歸估計結果及檢測異常
? 用於預測記錄歸屬的關鍵歸類技巧
? 從數據學習到的統計機器學習方法
? 用於從未標記數據中提取意義的無監督學習方法
作者簡介
Peter Bruce 創立並發展壯大瞭Statistics.com上的統計學教育學院,該學院目前提供約90項統計學課程,近半數麵嚮數據科學傢。
Andrew Bruce 在學術、政府和商業各領域擁有超過30年的統計學和數據科學經驗,作為美國華盛頓大學統計學博士,他在同行評審的期刊上發錶過多篇論文。
精彩書評
“本書既不是另一部統計學教材,也不是機器學習手冊。它是更好的:運用清晰的解釋和豐富的實例,在實用統計學術語、原則和當下數據挖掘行話與實踐之間建立聯係。這是一本對於數據科學初學者和老手們而言都很棒的參考書。”
——Galit Shmueli(暢銷圖書《Data Mining for Business Analytics》係列主要作者,中國颱灣清華大學著名教授)
目錄
Preface
1. Exploratory Data Analysis
Elements of Structured Data
Further Reading
Rectangular Data
Data Frames and Indexes
Nonrectangular Data Structures
Further Reading
Estimates of Location
Mean
Median and Robust Estimates
Example: Location Estimates of Population and Murder Rates
Further Reading
Estimates of Variability
Standard Deviation and Related Estimates
Estimates Based on Percentiles
Example: Variability Estimates of State Population
Further Reading
Exploring the Data Distribution
Percentiles and Boxplots
Frequency Table and Histograms
Density Estimates
Further Reading
Exploring Binary and Categorical Data
Mode
Expected Value
Further Reading
Correlation
Scatterplots
Further Reading
Exploring Two or More Variables
Hexagonal Binning and Contours (Plotting Numeric versus Numeric Data)
Two Categorical Variables
Categorical and Numeric Data
Visualizing Multiple Variables
Further Reading
Summary
2. Data and Sampling Distributions
Random Sampling and Sample Bias
Bias
Random Selection
Size versus Quality: When Does Size Matter?
Sample Mean versus Population Mean
Further Reading
Selection Bias
Regression to the Mean
Further Reading
Sampling Distribution of a Statistic
Central Limit Theorem
Standard Error
Further Reading
The Bootstrap
Resampling versus Bootstrapping
Further Reading
Confidence Intervals
Further Reading
Normal Distribution
Standard Normal and QQ-Plots
Long-Tailed Distributions
Further Reading
Student's t-Distribution
Further Reading
Binomial Distribution
Further Reading
Poisson and Related Distributions
Poisson Distributions
Exponential Distribution
Estimating the Failure Rate
……
3. Statistical Experiments and Significance Testing
4. Regression and Prediction
5. Classification
6. Statistical Machine Learning
7. Unsupervised Learning
Bibliography
Index
麵嚮數據科學傢的實用統計學(影印版) [1. Practical Statistics for Data Scientists] 下載 mobi epub pdf txt 電子書
麵嚮數據科學傢的實用統計學(影印版) [1. Practical Statistics for Data Scientists] pdf epub mobi txt 電子書 下載