What Programming Languages Are Used Most on Weekends?
An analysis using StackLite, a Kaggle dataset of Stack Overflow questions and tags
Machine learning, text analysis, and more
An analysis using StackLite, a Kaggle dataset of Stack Overflow questions and tags
An analysis of last year’s survey and a Shiny app
I spoke on approaching text mining tasks using tidy data principles at rstudio::conf yesterday. I was so happy to have the opportunity to speak and the conference has been a great experience. If you want to catch up on what has been going on at rstudio::conf, Karl Broman put together a GitHub repo of slides and Sharon Machlis has been live-blogging the conference at Computerworld. A highlight for me was Andrew Flowers' talk on data journalism and storytelling; I don’t work in data journalism but I think I can apply almost everything he said to how I approach what I do.
Text mining of one day’s submissions on Reddit
Readability in text using tidy data principles