Cover Coming Soon

The Shape of Data

Network Science, Geometry-Based Machine Learning, and Topological Data Analysis in R
by Colleen M. Farrelly and Yaé Ulrich Gaba
June 2023, 272 pp.
Use coupon code PREORDER to get 25% off!

The Shape of Data shows how to use geometry- and topology-based algorithms for machine learning. Focused on practical applications rather than dense mathematical concepts, the book progresses through coding examples using social network data, text data, medical data, and education data. Readers will come away with an entirely new toolkit to use in their own machine-learning work, as well as with a solid understanding of some of the most exciting algorithms being used in the field today.

Author Bio 

Colleen M. Farrelly is a senior data scientist whose academic and industry research has focused on topological data analysis, quantum machine learning, geometry-based machine learning, network science, hierarchical modeling, and natural language processing. Since graduating from University of Miami with an MS in Biostatistics, Colleen has worked as a data scientist in a variety of industries, including health care, consumer packaged goods, biotech, nuclear engineering, marketing, and education. Colleen often speaks at tech conferences, including PyData, SAS Global, WiDS, Data Science Africa, and DataScience SALON. When not working, Colleen can be found writing haibun/haiga or doing any sort of water sport.

Yaé Ulrich Gaba completed his doctoral studies at the University of Cape Town (UCT, South Africa) with specialization in Topology and is presently a research associate at Quantum Leap Africa (QLA, Rwanda). His research interests are computational geometry, applied algebraic topology (topological data analysis), and geometric machine learning (graph and point-cloud representation learning). His current focus lies in geometric methods in data analysis, and his work seeks to develop effective and theoretically justified algorithms for data/shape analysis using geometric and topological ideas and methods.

Table of contents 

1. Why Geometry?
2. Introduction to Network Data
3. Network Analysis
4. Beyond Networks
5. Geometry in Data Science
6. Other Applications of Geometry in Machine Learning
7. Topological Data Analysis
8. Algorithms Related to Homotopy
9. Working with Language
10. Computational Solutions for TDA Algorithms

The chapters in red are included in this Early Access PDF.