ad holder

Python数据处理(影印版 英文版)

Python数据处理(影印版 英文版) 下载 mobi epub pdf 电子书 2024


简体网页||繁体网页
[美] 杰奎琳·卡兹奥,凯瑟琳·嘉穆 著



点击这里下载
    


想要找书就要到 图书大百科
立刻按 ctrl+D收藏本页
你会得到大惊喜!!

发表于2024-04-28

类似图书 点击查看全场最低价

图书介绍

出版社: 东南大学出版社
ISBN:9787564170035
版次:1
商品编码:12052873
包装:平装
开本:16开
出版时间:2017-02-01
用纸:胶版纸
页数:488
字数:622000
正文语种:英文


相关图书





图书描述

内容简介

  《Python数据处理(影印版 英文版)》向非程序员展示了如何处理本质上太杂乱或难以下手的信息。你不需要了解Pvthon编程语言基础知识就可以开始了。
  通过循序渐进的练习,你将学习如何有效地获取、清理、分析和呈现数据。你还将了解如何将数据处理自动化,安排文件编辑和清理任务,处理*大的数据集,以及使用所获取的数据讲述引人注目的故事。
  快速学习基本的Python语法、数据类型和语言概念使用机器可读和人类可用的数据抓取网站和API以查找大量有用的信息清理和格式化数据以消除数据集里的重复和错误数据了解何时标准化数据以及何时测试和编写脚本进行数据清理使用新的Python库和技术探索并分析数据集使用Python解决方案自动化整个数据处理过程

目录

Preface
1. Introduction to Python
Why Python
Getting Started with Python
Which Python Version
Setting Up Python on Your Machine
Test Driving Python
Install pip
Install a Code Editor
Optional: Install IPython
Summary

2. Python Basics
Basic Data Types
Strings
Integers and Floats
Data Containers
Variables
Lists
Dictionaries
What Can the Various Data Types Do?
String Methods: Things Strings Can Do
Numerical Methods: Things Numbers Can Do
List Methods: Things Lists Can Do
Dictionary Methods: Things Dictionaries Can Do
Helpful Tools: type, dir, and help
type
dir
help
Putting It All Together
What Does It All Mean?
Summary

3. Data Meant to Be Read by Machines
CSV Data
How to Import CSV Data
Saving the Code to a File; Running from Command Line
JSON Data
How to Import ]SON Data
XML Data
How to Import XML Data
Summary

4. Working with Excel Files
Installing Python Packages
Parsing Excel Files
Getting Started with Parsing
Summary

5. PDFs and Problem Solving in Python
Avoid Using PDFs!
Programmatic Approaches to PDF Parsing
Opening and Reading Using slate
Converting PDF to Text
Parsing PDFs Using pdfminer
Learning How to Solve Problems
Exercise: Use Table Extraction, Try a Different Library
Exercise: Clean the Data Manually
Exercise: Try Another Tool
Uncommon File Types
Summary

6. Acquiring and Storing Data
Not All Data Is Created Equal
Fact Checking
Readability, Cleanliness, and Longevity
Where to Find Data
Using a Telephone
US Government Data
Government and Civic Open Data Worldwide
Organization and Non-Government Organization (NGO) Data
Education and University Data
Medical and Scientific Data
Crowdsourced Data and APIs
Case Studies: Example Data Investigation
Ebola Crisis
Train Safety
Football Salaries
Child Labor
Storing Your Data: When, Why, and How?
Databases: A Brief Introduction
Relational Databases: MySQL and PostgreSQL
Non-Relational Databases: NoSQL
Setting Up Your Local Database with Python
When to Use a Simple File
Cloud-Storage and Python
Local Storage and Python
Alternative Data Storage
Summary

7. Data Cleanup: Investigation, Matching, and Formatting
Why Clean Data?
Data Cleanup Basics
Identifying Values for Data Cleanup
Formatting Data
Finding Outliers and Bad Data
Finding Duplicates
Fuzzy Matching
RegEx Matching
What to Do with Duplicate Records
Summary

8. Data Cleanup: Standardizing and Scripting
Normalizing and Standardizing Your Data
Saving Your Data
Determining What Data Cleanup Is Right for Your Project
Scripting Your Cleanup
Testing with New Data
Summary

9. Data Exploration and Analysis
Exploring Your Data
Importing Data
Exploring Table Functions
Joining Numerous Datasets
Identifying Correlations
Identifying Outliers
Creating Groupings
Further Exploration
Analyzing Your Data
Separating and Focusing Your Data
What Is Your Data Saying?
Drawing Conclusions
Documenting Your Conclusions
Summary

10. Presenting Your Data
Avoiding Storytelling Pitfalls
How Will You Tell the Story?
Know Your Audience
Visualizing Your Data
Charts
Time-Related Data
Maps
Interactives
Words
Images, Video, and Illustrations
Presentation Tools
Publishing Your Data
Using Available Sites
Open Source Platforms: Starting a New Site
Jupyter (Formerly Known as IPython Notebooks)
Summary

11. Web Scraping: Acquiring and Storing Data from the Web
What to Scrape and How
Analyzing a Web Page
Inspection: Markup Structure
Network/Timeline: How the Page Loads
Console: Interacting with JavaScript
In-Depth Analysis of a Page
Getting Pages: How to Request on the Internet
Reading a Web Page with Beautiful Soup
Reading a Web Page with LXML
A Case for XPath
Summary

12. Advanced Web Scraping: Screen Scrapers and Spiders
Browser-Based Parsing
Screen Reading with Selenium
Screen Reading with Ghost.Py
Spidering the Web
Building a Spider with Scrapy
Crawling Whole Websites with Scrapy
Networks: How the Internet Works and Why It's Breaking Your Script
The Changing Web (or Why Your Script Broke)
A (Few) Word(s) of Caution
Summary

13. APIs
API Features
REST Versus Streaming APIs
Rate Limits
Tiered Data Volumes
API Keys and Tokens
A Simple Data Pull from Twitter's REST API
Advanced Data Collection from Twitter's REST API
Advanced Data Collection from Twitter's Streaming API
Summary

14. Automation and Scaling
Why Automate?
Steps to Automate
What Could Go Wrong?
Where to Automate
Special Tools for Automation
Using Local Files, argv, and Config Files
Using the Cloud for Data Processing
Using Parallel Processing
Using Distributed Processing
Simple Automation
CronJobs
Web Interfaces
Jupyter Notebooks
Large-Scale Automation
Celery: Queue-Based Automation
Ansible: Operations Automation
Monitoring Your Automation
Python Logging
Adding Automated Messaging
Uploading and Other Reporting
Logging and Monitoring as a Service
No System Is Foolproof
Summary

15. Conclusion
Duties of a Data Wrangler
Beyond Data Wrangling
Become a Better Data Analyst
Become a Better Developer
Become a Better Visual Storyteller
Become a Better Systems Architect
Where Do You Go from Here?
A. Comparison of Languages Mentioned
B. Python Resources for Beginners
C. Learning the Command Line
D. Advanced Python Setup
E. Python Gotchas
F. IPython Hints
G. Using Amazon Web Services
Index
Python数据处理(影印版 英文版) 下载 mobi epub pdf txt 电子书 格式

Python数据处理(影印版 英文版) mobi 下载 pdf 下载 pub 下载 txt 电子书 下载 2024

Python数据处理(影印版 英文版) 下载 mobi pdf epub txt 电子书 格式 2024

Python数据处理(影印版 英文版) 下载 mobi epub pdf 电子书
想要找书就要到 图书大百科
立刻按 ctrl+D收藏本页
你会得到大惊喜!!

用户评价

评分

书的质量很好,介绍了常用的算法及其实现

评分

知道你们想用我们用户的评价文本数据做sentiment analysis,我偏偏写个异常数据,看你们的算法能识别不?

评分

非入门基础书,需要有python基础才适合

评分

非常不错,值得给五星!

评分

一口气买了好多的书 感觉很棒棒 和室友一起拼单的 够看很久了

评分

不错不错,一次性买了一堆书,够看上一年了。

评分

对sklearn的整体框架讲的比较清楚,是一本入门sklearn和机器学习的好书

评分

很好,满满的干货,活动价也很实惠,支持京东

评分

这本书看起来还不错,是这么一回事,好好干,会有突破

类似图书 点击查看全场最低价

Python数据处理(影印版 英文版) mobi epub pdf txt 电子书 格式下载 2024


分享链接








相关图书


本站所有内容均为互联网搜索引擎提供的公开搜索信息,本站不存储任何数据与内容,任何内容与数据均与本站无关,如有需要请联系相关搜索引擎包括但不限于百度google,bing,sogou

友情链接

© 2024 book.qciss.net All Rights Reserved. 图书大百科 版权所有