<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Machine Learning on David Bartolomei-Guzmán</title><link>https://www.davidbartolomei.com/categories/machine-learning/</link><description>Recent content in Machine Learning on David Bartolomei-Guzmán</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Sun, 05 Apr 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://www.davidbartolomei.com/categories/machine-learning/feed.xml" rel="self" type="application/rss+xml"/><item><title>Migrating from RunPod to Local Whisper Inference with MLX and a DGX Spark</title><link>https://www.davidbartolomei.com/migrating-from-runpod-to-local-whisper-inference/</link><pubDate>Sun, 05 Apr 2026 00:00:00 +0000</pubDate><guid>https://www.davidbartolomei.com/migrating-from-runpod-to-local-whisper-inference/</guid><description>&lt;p&gt;In my &lt;a href="https://www.davidbartolomei.com/case-study-leveraging-machine-learning-for-spoken-media-analysis-share-of-voice-of-puerto-ricos-political-figures-in-2024/"&gt;previous article&lt;/a&gt;, I described how we use OpenAI&amp;rsquo;s Whisper model to transcribe radio and TV broadcasts for Monitorea, our media monitoring platform. At the time, we were running inference on RunPod - a serverless GPU platform that lets you deploy ML models without managing hardware. It was the right call to get started quickly. But as we scaled, the economics stopped making sense.&lt;/p&gt;
&lt;p&gt;Here&amp;rsquo;s how we migrated to fully local inference in about a weekend, using MLX on Apple Silicon and a DGX Spark we call Sparky.&lt;/p&gt;</description></item><item><title>Launching Monitorea: AI Agents for Broadcast Media Intelligence</title><link>https://www.davidbartolomei.com/launching-monitorea-ai-agents-for-broadcast-media-intelligence/</link><pubDate>Wed, 01 Oct 2025 00:00:00 +0000</pubDate><guid>https://www.davidbartolomei.com/launching-monitorea-ai-agents-for-broadcast-media-intelligence/</guid><description>&lt;p&gt;A year ago I published a &lt;a href="https://www.davidbartolomei.com/case-study-leveraging-machine-learning-for-spoken-media-analysis-share-of-voice-of-puerto-ricos-political-figures-in-2024/"&gt;case study&lt;/a&gt; analyzing Share of Voice across Puerto Rico&amp;rsquo;s AM radio stations using Whisper transcriptions. The article ended with a long list of &amp;ldquo;future work&amp;rdquo; — fine-tuning, entity recognition, segment classification, summarization. At the time, those were ideas I wanted to explore. As of this month, most of them are running in production.&lt;/p&gt;
&lt;p&gt;&lt;a href="https://monitorea.ai"&gt;Monitorea&lt;/a&gt; is now in private beta. Here&amp;rsquo;s what changed and how we got here.&lt;/p&gt;</description></item><item><title>Case Study: Leveraging Machine Learning for Spoken Media Analysis – Share of Voice of Puerto Rico’s Political Figures in 2024</title><link>https://www.davidbartolomei.com/case-study-leveraging-machine-learning-for-spoken-media-analysis-share-of-voice-of-puerto-ricos-political-figures-in-2024/</link><pubDate>Tue, 01 Oct 2024 00:00:00 +0000</pubDate><guid>https://www.davidbartolomei.com/case-study-leveraging-machine-learning-for-spoken-media-analysis-share-of-voice-of-puerto-ricos-political-figures-in-2024/</guid><description>&lt;p&gt;This is the first in a series of articles where I share my findings exploring Speech-to-Text (STT) ML models to transcribe and analyze spoken content in news media. In this article, I discuss how STT output can be used for automatic mention detection and tracking metrics such as Share of Voice of political figures in Puerto Rico during the 2024 election season.&lt;/p&gt;
&lt;h2 id="the-back-story"&gt;The Back Story&lt;/h2&gt;
&lt;p&gt;Before diving into the details, here’s a brief back story on what sparked my interest in this topic. You can skip directly to the results by scrolling down.&lt;/p&gt;</description></item></channel></rss>