We don’t know about you, but at the end of this project we find ourselves rejuvenated, empowered, and somewhat exhausted. In writing ten weeks of daily blog entries, we have learned a tremendous amount in terms of technical skills, pedagogical ideas to try in our fall courses, and ways to connect to the amazing & large data science community.
One of our main goals we set for ourselves this summer was to create a roadmap for faculty development to “ease the learning curve and help busy people incorporate new tools and approaches into their teaching.” As each individual blog reader is in a different spot on the learning curve, we hope that our entries have been useful for both those who are just starting their journeys and those with much more expertise than we have. As a way of summarizing, we’ve provided an organization of the entire list of blog entries. Keep in mind that the blog also has tags for each entry, so feel free to continue to explore the blog in whatever ways are helpful for you.
What topics did the blog cover?
- data science software
- data ingestation
- data technologies
- Leaflet for mapping
- infer
- Less Volume More Creativity in R
- Style Guides for Coding
- Creating Tutorials and Lessons in R using learnr
- Five Quick Tips for Coding in the Classroom
- Teaching refactoring to improve code
- Using CODAP to teach data science
- reprex: Help me help you
- Parallel processing and sparklyr
- reticulate: running Python within RStudio
- Writing R data packages
- testthat
- Counting commits and code review
- One model to rule them all
- Cloud computing redux
- data wrangling
- visualization and exploration
- communication
- Watching an expert work through a data analysis using the tidyverse
- Not So Standard Deviations: not your average data science podcast
- Diversity in Data Science & Statistics
- Practical Data Science: an introduction to the PeerJ collection
- Pair Programming for Data Science and Statistics
- Data Science for Good
- The Conversation Around p-values
- useR!
- data8
- data100
- The Python Community
- Algorithmic Bias
- key reports and findings
Who are we?
Hunter Glanz (twitter) is an Assistant Professor of Statistics and Data Science at California Polytechnic State University (Cal Poly, San Luis Obispo). He received a BS in Mathematics and a BS in Statistics from Cal Poly, San Luis Obispo followed by an MA and PhD in Statistics from Boston University. He maintains a passion for machine learning and statistical computing, and enjoys advancing education efforts in these areas. In particular, Cal Poly’s courses in R, SAS, and Python give him the opportunity to connect students with exciting data science topics amidst a firm grounding in communication of statistical ideas. Hunter serves on numerous committees and organizations dedicated to delivering cutting edge statistical and data science content to students and professionals alike. In particular, the ASA’s DataFest event at UCLA has been an extremely rewarding experience for the teams of Cal Poly students Hunter has had the pleasure of advising.
Jo Hardin (twitter) is a statistician at Pomona College who is passionate about statistics and data science education for all. She received her BA from Pomona College and her MS and PhD degrees from the University of California, Davis. Two years of working at the Fred Hutchinson Cancer Research Center in Seattle, WA got her hooked into analyzing high throughput data (e.g., simultaneous gene expression of thousands of genes). Much of her theoretical work has focused on computational approaches to statistical problems in genetics. Jo works hard to provide her students with current best practices, including teaching the tidyverse
to her students in Introduction to Statistics. Beyond the classroom, she has also worked to engage students in statistics and data science: she has worked on the ASA’s curriculum guidelines task force, sent groups of students to UCLA’s DataFest competition, and hosted a local StatFest conference. Through her endeavors, she is working to help undergraduates learn about analyzing data in the wild and gain the skills to be effective and ethical when tasked with making claims from data. This fall she will be teaching statistics inside the California Rehabilitation Center through the international Inside-Out Prison Exchange Program.
Nick Horton (twitter) is Beitzel Professor of Technology and Society and Professor of Statistics and Data Science at Amherst College. His recent work has focused on statistics and data science education. Nick is a fellow of the American Statistical Association and the American Association for the Advancement of Science. He chaired the Committee of Presidents of Statistical Societies and the ASA Curriculum Guidelines for Undergraduate Programs in Statistical Science workgroup. Nick serves on the National Academies Committee for Applied and Theoretical Statistics and is a co-author of the 2018 “Undergraduate Data Science: Opportunities and Options” consensus study report and the ASA’s revised GAISE (Guidelines for Assessment and Instruction in Statistics Education) College report.