HTCondor
Overview
Download
Documentation
Release Highlights
Release Schedule

HTCondor-CE
Overview
Download
Documentation
Release Highlights

General
News
Security
Documentation
Download
License

Documentation
HTCondor
HTCondor-CE

Support
HTCSS Contract Support
Community Mailing List
External Support Organizations

Email Lists
Users / Help
World / Announcements

Workshops
Throughput Computing 2025
Materials from past Workshops

HTCSS
What is HTCSS?
Logos

CHTC
Website ⬏
Staff ⬏
Jobs ⬏
Research & Publications ⬏

Software News

Breaking the GPU Bottleneck: How Distributed Computing is Expanding AI Training

May 12, 2026

Training complex AI models usually requires exclusive access to massive, centralized supercomputers. This rigid requirement has created a “compute divide” in the scientific community, locking many researchers out of high-level machine learning discovery.

To combat this, a collaborative research team at UW–Madison and the Morgridge Institute for Research is leveraging the NAIRR Pilot to prove that AI training doesn’t have to be centralized. Through their project, the team used distributed High Throughput Computing (dHTC) to break massive AI training tasks down into “small bites.”

Using software tools like HTCondor, they successfully distributed these fragmented workloads across a nationwide network of computing providers. By harnessing small, opportunistically scheduled pockets of available GPU time across 13 different sites, the team successfully trained highly complex models without any degradation in quality.

This innovative approach is dismantling the barrier to entry for machine learning, turning the nation’s collective computing power into a shared engine for scientific inquiry and ensuring the next great breakthrough can come from any researcher, anywhere.

Read more on the NAIRR Pilot website: https://nairrpilot.org/projects/highlights/breaking-gpu-bottleneck

HTCondor European Workshop 2026

How one Researcher Uses CHTC To Fill in the Gaps in AI

Advancing Science Internationally Through Collaboration

OSPool and Its Computing Capacity Helps Researchers Design New Proteins

Researcher Receives David Swanson Award for Work Powered by High Throughput Computing

One Researcher’s Leap into Throughput Computing: Bringing Machine Learning to Dairy Farm Management

What is HTCSS?

Breaking the GPU Bottleneck: How Distributed Computing is Expanding AI Training