AI Development Navigates The Latency Sensitivity Spectrum: Training Allows For Slow Processing, But Real-Time Tasks Require Lightning-Fast Inference

May 20, 2024


Latency sensitivity in AI processes varies significantly between training and inference. Training operations, which involve processing large datasets over extended periods, are generally very tolerant of high latency. This tolerance allows training tasks to be performed with minimal concern for immediate responsiveness.

Wes Cummins, the CEO of Applied Digital joins David Liggitt, the Founder and CEO of datacenterHawk to talk about the spectrum of latency sensitivity within AI inference tasks. Mission-critical inference applications require ultra-low latency and high reliability, often needing to operate in cloud regions with five-nines reliability. Conversely, batch inference tasks, such as those involving generative AI for text-to-image or text-to-video conversions, can afford much higher latency. Chatbots and similar applications fall somewhere in between, with reasonable tolerance for latency variations.

Recent Episodes

power availability
View episode

The strategic placement of data centers is becoming increasingly vital for maximizing efficiency and sustainability. Applied Digital is leading this change by prioritizing power availability and focusing on sustainable solutions to enhance next-generation computing. Applied Digital dives into its methodology for selecting data center locations by prioritizing power availability, particularly sustainable power sources, and…

View episode

Today, telecommunications is characterized by increasing technological advances and evolving market conditions, making the industry stand at a crucial juncture. With ongoing government initiatives like the Broadband Equity Access and Deployment Program injecting significant capital into infrastructure, the stakes are high for companies to adapt and thrive. This dynamic landscape demands a nuanced understanding…

sustainable solutions
View episode

The advent of next-generation data centers is revolutionizing the AI landscape, driven by a blend of innovative technology and sustainable solutions. Applied Digital leads the charge in this transformation, by utilizing their cutting-edge data center infrastructure to maximize the potential of Artificial Intelligence. In a recent discussion, Applied Digital highlighted its commitment to sustainable…