Uncategorized – Beat the Bookie

Jan 6, 2024Jan 9, 2024

Comparing the predictive power of different xG data providers

In the realm of sports betting, predictive analytics hinges on quality data, a challenge given the cost associated with many paid services. This article delves into the world of free xG (Expected Goals) data providers used by the BeatTheBookie services, assessing their predictive power for football betting, a critical aspect for enthusiasts who seek to enhance their strategies without breaking the bank.

Dec 25, 2022

Vanilla Poisson performance incl. minor leagues

As already done for the ZIP Poisson model, I also added some smaller leagues data to my Vanilla Poisson model: Championship, Seria B, La Liga 2, Eredivise, Liga Portugal. All these additional leagues are already available through my data service. So it’s time to take a look how profitable these new leagues are using the Vanilly Poisson model.

Aug 22, 2022Jun 6, 2023

Inflated ML Poisson model to predict football matches

My last blog post “Poisson vs Reality” did change something in my head. I realized, that I not yet checked single parts of my model enough, whether they differ from reality and whether I could reduce this difference and improve the model performance. That’s why I started creating a new model approach for the new season and focus on the improvement of single steps during the model process. After the training of multiple models, I will test against the fair profit, which kind of adaptions improve a Poisson distribution model the most.

Feb 5, 2022

Using xG & advanced stats to predict football matches

With the BeatTheBookieDataService in place it’s also time to provide some new models. This post will take a look at possible models using the team statistics provided for each match by understat.com. Therefor I will compare 3 of the most used machine learning algorithms. Beside this, it’s also time to test again some basics for predictiv modeling for football: “To differ between home/away performance or not to differ”? For my Poisson models I always differed between home and away performance. But is this also needed, when using ML algorithms?

Mar 12, 2021Mar 12, 2021

Running Exasol on AWS

Automating data pipelines in AWS was just the first step of moving my betting models into the cloud. Nearly all my calculations were done in a Exasol database and I also want to keep them in a database. So I need to host one in my AWS account. For such use-cases AWS offers virtual EC2 instances. This blog will explain the single steps how to install an Exasol DB in AWS.

Oct 25, 2020Dec 14, 2020

Why every data scientist should learn SQL

It’s been quite a long time since my last post for my blog. But that has been because of a specific reason: I participated at the 2nd DFB Hackathon, which consumed a huge amount of my freetime, which I normally spent creating some content for my blog. The Hackathon was again a great experience as all this deep data science stuff is still a challenge for me. But there’s again on big question on my side: Why are data scientist often just using Python (or R) and don’t know, how and when to use SQL.