Predicting Best Picture at the 2026 Academy Awards

January 12, 2026

March 9th, 2026

This is definitely the most interesting year that I’ve done this forecast. The SAG, ACE, and WGA awards tightened the race in the final weeks to now the prediction is pretty simple. There’s about a 50% chance that One Battle After Another wins. There’s about a 33% chance Sinners wins. Which leaves about a 17% chance that any of the other eight movies win Best Picture. Putting it another way: You have a side-sided die. If you roll a 1, 2, or 3, then OBAA wins. If you roll a 4 or 5, Sinners wins. If you roll a 6, then one of the other films wins. When you put it that way, things feel wide open.

And so, my official prediction would be for One Battle After Another to win—while noting that there is a whole lot of uncertainty going into this weekend.

R and Python Together: Refactoring and Prompt Engineering A Previous Case Study, Using the Perplexity API

December 30, 2024

I wrote a post last year looking at how to employ tools in LangChain to have GPT-3.5 Turbo access information on the web, outside of its training data.

The purpose of the present post is to revisit this post, improving the poor performance I saw there through refactoring and prompt engineering.

Background

The motivating example is again using large language models (LLMs) to help me calculate features for my Oscar model. Specifically: How many films did the director of a Best Picture nominee direct before the nominated film?

Rethinking How I Do Supervised Topic Modeling, Using ModernBERT and GPT-4o mini

December 21, 2024

I wrote a post in July 2023 describing my process for building a supervised text classification pipeline. In short, the process first involves reading the text, writing a thematic content coding guide, and having humans label text. Then, I define a variety of ways to pre-process text (e.g., word vs. word-and-bigram tokenizing, stemming vs. not, stop words vs. not, filtering on the number of times a word had to appear in the corpus) in a workflowset. Then, I run these different pre-processors through different standard models: elastic net, XGBoost, random forest, etc. Each class of text has its own model, so I would run this pipeline five times if there were five topics in the text. Importantly, this is not natural language processing (NLP), as it was a bag-of-words approach.

The idea was to leverage the domain knowledge of the experts on my team through content coding, and then scaling it up using a machine learning pipeline. In the post, I bemoaned how most of the “NLP” or “AI-driven” tools I had tested did not do very well. The tools I was thinking of were all web-based, point-and-click applications that I had tried out since about 2018, and they usually were unsupervised.

We are in a wildly different environment now when it comes to analyzing text than we were even a few years ago. I am revisiting that post to explore alternate routes to classifying text. I will use the same data as I did in that post: 720 Letterboxd reviews of Wes Anderson’s film Asteroid City. There is only one code: Did the review discuss Wes Anderson’s unique visual style (1) or not (0)? I hand-labeled all of these on one afternoon to give me a supervised dataset to play with.

Blog

Archive