Data Science Corner

Lecturely droplets of Data Science interesting stuff…

DateDescriptionLink
5/14 Keras (Python Deep Learning framework) https://keras.io/
5/14 Keras in R https://keras.rstudio.com/
5/6 Computational Journalism https://engineering.stanford.edu/magazine/article/what-future-computational-journalism
5/4 Shiny: interactive data apps with R http://shiny.rstudio.com/
5/4 D3.js: interactive data visualization for JavaScript https://d3js.org/
4/29 A commentary on the Santa Clara study I alluded to (from Andrew Gelman, his blog is quite the resource, highly recommended) https://statmodeling.stat.columbia.edu/2020/04/19/fatal-flaws-in-stanford-study-of-coronavirus-prevalence/
4/27 a16z: Another podcast I like (for those with an entrepenurial bent) https://a16z.com/2020/04/26/journal-club-machine-learning-antibiotics-coronavirus-protein-structures/
4/22 From the StitchFix Data Science Blog https://multithreaded.stitchfix.com/blog/2019/03/11/FullStackDS-Generalists/
4/15 More on gradient descent https://towardsdatascience.com/understanding-the-mathematics-behind-gradient-descent-dde5dc9be06e
4/8 Hacking pdfs for mobility data https://www.nxn.se/valent/2020/4/7/converting-images-of-line-graphs-to-data
4/6 Kaggle COVID-19 Open Research Dataset Challenge https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge
4/1 Visual explanation of stats and probability http://students.brown.edu/seeing-theory/
3/11 COVID-19 Data https://github.com/CSSEGISandData/COVID-19
3/9 Uber Engineering Blogpost: platform in 2019 https://eng.uber.com/uber-data-platform-2019/
3/9 Uber Engineering Blogpost on their Data Science Workbench https://eng.uber.com/dsw/
3/4 NASA's Helio Hackweek https://heliohackweek.github.io/
2/24 Data Science is becoming Software Engineering https://towardsdatascience.com/data-science-is-becoming-software-engineering-53e31314939a
2/19 One take on Data Science Survey https://www.theverge.com/2017/11/1/16589246/machine-learning-data-science-dirty-data-kaggle-survey-2017
2/17 Example wrangling with R and Python https://www.superdatascience.com/wrangling-data-r-python/
2/12 The Julia programming language https://julialang.org/
2/10 Best practices for computational science https://openresearchsoftware.metajnl.com/articles/10.5334/jors.ay/
2/10 Good enough practices in scientific computing http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005510
2/7 RStudio conference just finished last week https://resources.rstudio.com/rstudio-conf-2019
1/29 Data Skeptic: Podcast and more https://dataskeptic.com/
1/27 Enterprise Data Analysis and Visualization: An Interview Study. Sean Kandel, Andreas Paepcke, Joseph Hellerstein, Jeffrey Heer https://idl.cs.washington.edu/papers/enterprise-analysis-interviews/
1/27 David Donoho (Stanford Prof. of Statistics): 50 years of Data Science https://www.tandfonline.com/doi/full/10.1080/10618600.2017.1384734