Data Science at Scale with Python and Dask

Data Science at Scale with Python and Dask - Jesse C Daniel

Data Science at Scale with Python and Dask

Large datasets tend to be distributed, non-uniform, and prone to change. Dask simplifies the process of ingesting, filtering, and transforming data, reducing or eliminating the need for a heavyweight framework like Spark.



Data Science at Scale with Python and Dask teaches readers how to build distributed data projects that can handle huge amounts of data. The book introduces Dask Data Frames and teaches helpful code patterns to streamline the reader's analysis.



Key Features



Working with large structured datasets
Writing DataFrames
Cleaningand visualizing DataFrames
Machine learning with Dask-ML
Working with Bags and Arrays



Written for data engineers and scientists with experience using Python. Knowledge of the PyData stack (Pandas, NumPy, and Scikit-learn) will be helpful. No experience with low-level parallelism is required.



About the technology

Dask is a self-contained, easily extendible library designed to query, stream, filter, and consolidate huge datasets.



Jesse Daniel has five years of experience writing applications in Python, including three years working with in the PyData stack (Pandas, NumPy, SciPy, Scikit-Learn). Jesse joined the faculty of the University of Denver in 2016 as an adjunct professor of business information and analytics, where he currently teaches a Python for Data Science course.
Citeste mai mult

-10%

transport gratuit

PRP: 339.92 Lei

!

Acesta este Pretul Recomandat de Producator. Pretul de vanzare al produsului este afisat mai jos.

305.93Lei

305.93Lei

339.92 Lei

Primesti 305 puncte

Important icon msg

Primesti puncte de fidelitate dupa fiecare comanda! 100 puncte de fidelitate reprezinta 1 leu. Foloseste-le la viitoarele achizitii!

Indisponibil

Plaseaza rapid comanda

Important icon msg

Completeaza mai jos numarul tau de telefon

Poti comanda acest produs introducand numarul tau de telefon. Vei fi apelat de un operator Libris.ro in cele mai scurt timp pentru prealuarea datelor necesare.

Descrierea produsului

Large datasets tend to be distributed, non-uniform, and prone to change. Dask simplifies the process of ingesting, filtering, and transforming data, reducing or eliminating the need for a heavyweight framework like Spark.



Data Science at Scale with Python and Dask teaches readers how to build distributed data projects that can handle huge amounts of data. The book introduces Dask Data Frames and teaches helpful code patterns to streamline the reader's analysis.



Key Features



Working with large structured datasets
Writing DataFrames
Cleaningand visualizing DataFrames
Machine learning with Dask-ML
Working with Bags and Arrays



Written for data engineers and scientists with experience using Python. Knowledge of the PyData stack (Pandas, NumPy, and Scikit-learn) will be helpful. No experience with low-level parallelism is required.



About the technology

Dask is a self-contained, easily extendible library designed to query, stream, filter, and consolidate huge datasets.



Jesse Daniel has five years of experience writing applications in Python, including three years working with in the PyData stack (Pandas, NumPy, SciPy, Scikit-Learn). Jesse joined the faculty of the University of Denver in 2016 as an adjunct professor of business information and analytics, where he currently teaches a Python for Data Science course.
Citeste mai mult

De pe acelasi raft

Parerea ta e inspiratie pentru comunitatea Libris!

Noi suntem despre carti, si la fel este si

Newsletter-ul nostru.

Aboneaza-te la vestile literare si primesti un cupon de -10% pentru viitoarea ta comanda!

*Reducerea aplicata prin cupon nu se cumuleaza, ci se aplica reducerea cea mai mare.

Ma abonez image one
Ma abonez image one