Evaluate multiple modeling approaches for #TidyTuesday spam email

Use workflowsets to evaluate multiple possible models to predict whether email is spam.

Classification metrics for #TidyTuesday GPT detectors

Learn about different kinds of metrics for evaluating classification models, and how to compute, compare, and visualize them.

What tokens are used more vs. less in #TidyTuesday place names?

Let’s use byte pair encoding tokenization along with Poisson regression to understand which tokens are more more often (or less often) in US place names.

Predict the magnitude of #TidyTuesday tornadoes with effect encoding and xgboost

How well can we predict the magnitude of tornadoes in the US? Let’s use xgboost along with effect encoding to fit our model.

Tune an xgboost model with early stopping and #TidyTuesday childcare costs

Can we predict childcare costs in the US using an xgboost model? In this blog post, learn how to use early stopping for hyperparameter tuning.

Blog