Analytics Blog
July 12, 2022
NIGHT MODE
DAY MODE
Who plays well at The Open?
— Using performance trends by golfer nationality to enhance our Open predictions
In the current version of our model, we assess a golfer’s suitability for a particular venue by focusing on their course fit and history. Course fit looks at the relationship between specific golfer skills and performance at the course of interest, while course history focuses on individual player histories. These two adjustments work well when we have many years of data to work off of, but as the amount of data from a given course shrinks, so too does our ability to make meaningful adjustments. In this post we take a broader approach to course-specific adjustments by examining performance trends by country, and then apply that approach to this week's Open Championship.

Most golf fans agree that players of certain nationalities tend to over or under-perform at events hosted in certain countries. There are several plausible reasons this could be true: a home-country effect — Australians playing well at events hosted in Australia, for example; or, the style of play of golfers from a certain country may not suit the style of golf courses in another — the first example that comes to mind is Americans playing poorly in the UK. By estimating nationality-country effects, we can make course-specific adjustments even when there is no historical data at the course of interest. Further, when there is historical data available, as is the case this week at St. Andrews, it’s likely that these country-based adjustments pick up elements of course fit that are missed in the skill-based course fit approach.

In addition to nationalities, we can look at the country-specific performance tendencies of individual golfers. For example, while on average American players haven't performed well in Scotland, Jordan Spieth has over-performed in his 20 rounds there. The tradeoff between using nationalities versus individuals is similar to the tradeoff between course fit and course history: the former increases the sample size, but relies on group characteristics to make individual predictions. Later, when making adjustments for our Open Championship predictions, we'll use the nationality-based estimate as a starting point, or prior, for our prediction of a golfer's performance on Scottish Links courses. Then, for golfers that have substantial Links histories, we move away from this nationality-based prior towards their individual performance records.
The starting point for this analysis is to define a baseline to compare a golfer’s performance to. For that we’ll use our full model which includes course-specific adjustments. Therefore if a given nationality, or golfer, is over-performing relative to their baseline in some country, this over-performance is not due to the usual skill-based course fit considerations that should be captured in our model’s predictions.

We first focus on country-specific performances by golfer nationality; in addition to nationality we group golfers by their primary tour (with just two labels: PGA/KFT, or European Tour). This allows English golfers who play primarily in the US to have different country effects than English golfers playing in Europe. This distinction makes sense if you think performance differences across countries are influenced by the types of courses golfers are currently playing (as opposed to the types of courses they grew up on), or if you think travel impacts performance. We include all tournaments since 2010 on the PGA, European, and Korn Ferry Tours. Shown below are the largest positive and negative nationality-country effects:
Nationality Tour Event Location SG Total Rounds Estimated Skill
RSA EURO RSA 0.34 3045 0.31
SCO EURO SCO 0.32 1028 0.22
RSA PGA/KFT RSA 0.39 590 0.20
ENG EURO NIR 0.36 511 0.19
AUS EURO AUS 0.42 372 0.17
Nationality Tour Event Location SG Total Rounds Estimated Skill
ENG EURO MAS (Malaysia) -0.46 505 -0.21
AUS PGA/KFT MEX -0.45 454 -0.2
USA PGA/KFT SCO -0.29 1214 -0.18
ESP EURO USA -0.42 414 -0.17
SCO EURO CHN -0.43 334 -0.15
The rightmost column displays the 'nationality-country effects', which can be interpreted as our predicted improvement or decline in performance (in strokes per round) for players of that nationality/tour in that country. This estimate comes from the group's overall strokes-gained (relative to baseline) in the country, with more regression towards zero applied for smaller sample sizes. (For those interested in the statistical details, head to the endnote.) It's pretty amazing how well South Africans have played in South Africa; I don't know enough about golf courses there to posit why this would be the case, but with more than 3000 rounds played it's hard to argue against the conclusion.

Next the same calculations are performed for individual golfers; here are the largest player-country effects, after restricting the sample to players with skill levels above that of an average PGA Tour player:
Nationality Player Event Location SG Total Rounds Estimated Skill
RSA Grace, Branden RSA 0.92 112 0.40
RSA Schwartzel, Charl RSA 0.66 122 0.30
FRA Levy, Alexander CHN 1.00 58 0.28
ENG Hatton, Tyrrell SCO 0.89 66 0.27
RSA Aiken, Thomas RSA 0.51 143 0.25
IRL Lowry, Shane ENG 0.7 72 0.23
ENG Willett, Danny RSA 0.81 58 0.23
ENG Fleetwood, Tommy SCO 0.61 82 0.22
ESP Cabrera Bello, Rafa UAE 0.44 138 0.21
USA Kizzire, Patton MEX 1.16 32 0.21
Nationality Player Event Location SG Total Rounds Estimated Skill
THA Jaidee, Thongchai USA -0.97 86 -0.36
ESP Larrazabal, Pablo USA -1.77 34 -0.33
SCO Gallacher, Stephen USA -0.79 68 -0.25
KOR An, Byeong Hun SCO -1.55 28 -0.25
USA Walker, Jimmy SCO -1.79 22 -0.23
ENG Fisher, Ross USA -0.41 154 -0.21
DEN Hansen, Soren USA -2.14 16 -0.21
NED Luiten, Joost SCO -0.67 66 -0.21
SWE Lingmerth, David SCO -2.35 14 -0.2
BEL Colsaerts, Nicolas USA -0.49 105 -0.2
Plenty of interesting histories here. I'm surprised by that Lingmerth number, I thought his small stature would help him in the strong winds that come with Scottish Links. (That was a joke.. just in case.)
Now let's apply this analysis to this week's Open Championship. To start, here is a snapshot from our country performance map that displays the performance by nationality at Open Championships since 2010. (There is a lot of interesting data to explore in the interactive version of this map, so I strongly suggest heading over).


As expected, players from the US underperform substantially at The Open: we estimate that PGA Tour-based Americans perform 0.23 strokes below their baselines. Interestingly, PGA Tour-based Englishmen and Scotsmen perform worse at Open Championships, while their European Tour counterparts perform well above baseline (0.33 strokes for English players — the highest of any country). Australians have not played well at The Open, which might surprise the golf architecture crowd, although this estimate is driven the by play of PGA Tour-based Aussies.

For the Open-specific adjustments that we actually use in our model this week, we look at players' past performance at all Links courses in the UK in addition to Open Championships. All of the plausible explanations for why country effects exist (e.g. style of course, travel) should still hold with this expanded list of courses. To arrive at what we call our "Links Adjustment", we use the nationality-specific estimates as a starting point, and then update with individual player data from that prior. For example, a PGA Tour-based American player who doesn't have a Links history is given a Links adjustment of -0.24 — which is the predicted under-performance on Links courses for that group. For a player like Zach Johnson, who has over-performed in his 38 Links rounds, his Links adjustment will move substantially away from the American prior of -0.24. The table below highlights the Links Adjustment for some selected players this week, along with a breakdown of how we got there.
Player Nationality Tour Links Performance Links Rounds Nationality-based Prior Posterior Skill Experience Bump Final Links Adjustment
Clarke, Darren NIR EURO 1.21 71 0.1 0.46 0.1 0.56
Jordan, Matthew ENG EURO 1.07 27 0.21 0.34 0.054 0.39
Macintyre, Robert SCO EURO 0.47 36 0.23 0.28 0.072 0.35
Hatton, Tyrrell ENG PGA/KFT 0.52 88 0.04 0.22 0.1 0.32
Scott, Adam AUS PGA/KFT 0.25 46 -0.05 0.02 0.092 0.12
Johnson, Zach USA PGA/KFT 0.83 38 -0.18 0.03 0.076 0.11
Burmester, Dean RSA EURO -0.17 56 0.04 -0.02 0.1 0.08
Rose, Justin ENG PGA/KFT -0.27 65 0.04 -0.06 0.1 0.04
Poston, J.T. USA PGA/KFT N/A 0 -0.18 -0.18 0 -0.18
Young, Cameron USA PGA/KFT -5.67 2 -0.18 -0.25 0.004 -0.25
Horschel, Billy USA PGA/KFT -1.23 28 -0.18 -0.34 0.056 -0.29
Darren Clark is receiving the largest Links Adjustment this week at a whopping 0.56 strokes per round! The Northern Ireland baseline (for European Tour-based players) is 0.1, but what's doing most of the work for Clarke is his incredible Links history: 1.21 strokes/round above our model's baseline in 71 rounds. Combining the nationality prior with his history (this is a weighted average, with more weight applied to the individual history the larger is the sample size) gives a skill estimate of 0.46 ("Updated Links Skill" column), and then we tack on 0.1 strokes for Links experience to arrive at the final Links adjustment of 0.56. You'll notice it takes a lot of data to move away from the nationality priors; golf performance is super noisy, so even a 40-round sample is affected to a large degree by randomness. Billy Horschel has the lowest Links adjustment in the field, thanks to his abysmal Links record (-1.23 strokes relative to baseline in 28 rounds). Cameron Young has only 2 Links rounds under his belt, but they didn't go well. Horschel and Young will be looking to buck their personal Links trends this week, while players like Zach Johnson will look to continue defying their American expectations.