AI data centers now consume more electricity than entire nations. Training a single large language model can use 1,000 MWh. This article examines the trends driving this growth and the solutions that can curb it.

AI data centers are on a trajectory to become one of the largest consumers of electricity on the planet. The Electric Power Research Institute (EPRI) projects that data centers could consume up to 4% of total U.S. electricity generation by 2030, up from approximately 1.5% in 2022, driven overwhelmingly by AI workloads. The International Energy Agency (IEA) estimates that global data center electricity consumption doubled between 2015 and 2024 and will double again by 2026, with AI as the primary accelerant. This is not incremental growth — it is a structural shift in the composition of electrical demand that has utilities, regulators, and grid operators scrambling to adapt.
The numbers at the workload level are staggering. Training a single large language model on the scale of GPT-4 is estimated to have consumed over 1,000 megawatt-hours (MWh) of electricity — roughly equivalent to the annual consumption of 100 average U.S. households. Next-generation frontier models, trained on larger datasets with more parameters and longer training runs, are projected to require 10,000 MWh or more. These are not theoretical projections; they are the natural consequence of scaling laws that have governed AI development since 2020, where each order-of-magnitude increase in compute budget yields predictable improvements in model capability.
While training has received the most attention for its dramatic per-run energy costs, inference — the process of generating outputs from a trained model — is rapidly becoming the dominant energy consumer in AI data centers. The reason is simple: training happens once (or infrequently for fine-tuning), but inference happens every time a user queries a model. With hundreds of millions of daily queries across services like ChatGPT, Gemini, and Claude, the cumulative energy of inference now exceeds that of training for most deployed models. A 2024 study by the University of Washington estimated that inference accounts for 60-80% of total AI compute energy consumption at major cloud providers, and this share will grow as AI services scale to billions of daily interactions.
The energy cost per inference query varies by model size and architecture, but even small models consume meaningful amounts of power at scale. A single inference request to a 70-billion-parameter model consumes approximately 0.1-0.5 watt-hours — trivial in isolation, but at 100 million queries per day, this translates to 10,000-50,000 MWh per year for a single service. Aggregated across the growing ecosystem of AI-powered applications, from search to coding assistants to autonomous agents, inference energy demand is compounding at 40-60% annually.
The power consumption of individual GPUs has increased dramatically with each generation. The NVIDIA H100, which became the workhorse of AI training in 2024, draws approximately 700 watts per chip under load. The NVIDIA B200, announced for 2025, pushes thermal design power (TDP) to approximately 1,000 watts. Next-generation chips expected in 2026-2027 are projected to exceed 1,500 watts, driven by the demand for larger memory capacities, higher memory bandwidth, and faster interconnect speeds required by frontier-scale models.
At the rack level, the power density implications are profound. A standard 42U rack populated with H100 GPUs draws approximately 40-60 kW — well beyond the 5-10 kW per rack that most legacy data centers were designed to support. B200-based racks push this to 80-120 kW, and next-generation racks will exceed 150 kW. This power density creates cascading challenges: electrical distribution systems must deliver more power per square foot, cooling systems must remove more heat per rack, and building infrastructure must accommodate the resulting thermal loads. Many existing data centers simply cannot be retrofitted to support these densities, requiring entirely new construction.
Power Usage Effectiveness (PUE) remains the primary metric for data center energy efficiency, calculated as total facility power divided by IT equipment power. A PUE of 1.0 would mean every watt of electricity goes to computing — a theoretical ideal. In practice, the industry average PUE stands at approximately 1.58 according to the Uptime Institute's 2024 survey, meaning that for every watt delivered to servers, an additional 0.58 watts powers cooling, lighting, and facility systems. Best-in-class facilities achieve PUE values of 1.05-1.10 through advanced cooling designs, optimized airflow management, and favorable climates that reduce cooling loads. Harch Intelligence targets a PUE below 1.10 at its Dakhla 500MW data center, leveraging the region's naturally cool coastal climate and direct liquid cooling systems to minimize cooling overhead.
The gap between average and best-in-class PUE represents enormous wasted energy. A 100MW data center operating at PUE 1.58 spends 36.7MW on non-computing loads. The same facility at PUE 1.10 spends just 9.1MW on overhead — a savings of 27.6MW, equivalent to the continuous power consumption of approximately 25,000 homes. At Harch Intelligence's scale of 1,798 GPUs across five hubs, each 0.1 improvement in PUE translates to measurable reductions in both energy cost and carbon intensity.
Cooling systems account for 30-40% of total data center energy consumption at facilities using traditional air-cooling methods. This includes computer room air conditioning (CRAC) units, chillers, pumps, and fans that maintain operating temperatures for IT equipment. As GPU power densities increase beyond 60 kW per rack, air cooling approaches its physical limits — the volume of air required to remove that much heat exceeds what can be practically delivered through raised-floor or overhead distribution systems.
Liquid cooling technologies offer a step-change improvement in cooling efficiency. Direct-to-chip liquid cooling circulates coolant through cold plates mounted directly on GPU processors, achieving heat transfer coefficients 1,000-3,000 times higher than air. Immersion cooling submerges entire servers in dielectric fluid, eliminating the need for fans and reducing cooling energy by 40-60% compared to air cooling. Free cooling, which uses ambient outdoor air or water to cool data center systems without mechanical refrigeration, can eliminate cooling energy entirely during favorable conditions. The key metric for free cooling is the number of hours per year when ambient temperature falls below the supply air temperature threshold — typically 20-25°C for most data center configurations.
The Dakhla region in southern Morocco offers a uniquely favorable combination for data center cooling. Despite its location on the edge of the Sahara, Dakhla's coastal position on the Atlantic Ocean means average ambient temperatures remain moderate — between 17°C and 24°C year-round — with consistent ocean breezes that enable more free cooling hours than virtually any other location at comparable latitude. The Dakhla 500MW data center, the largest AI compute project in Africa, leverages this climate to achieve significantly more free cooling hours than inland data centers in Europe or the Middle East, directly reducing both PUE and total energy consumption.
Beyond cooling, North Africa's renewable energy resources provide a second structural advantage. Morocco receives among the highest solar irradiance levels in the world — approximately 3,000+ kWh per square meter per year in the southern regions — enabling solar photovoltaic generation at capacity factors of 25-30%, well above the 15-20% typical in Northern Europe. The country's Atlantic coast also features world-class wind resources, with capacity factors exceeding 40% at coastal sites. The combination of abundant renewable generation and favorable cooling conditions means that AI data centers in Morocco can achieve lower total energy costs and lower carbon intensity simultaneously compared to facilities in Europe or North America.
Co-locating data centers with renewable energy generation reduces transmission losses and improves the economics of both investments. When a data center purchases power through the grid, 5-8% of generated electricity is lost in transmission and distribution. Direct connection to on-site or nearby solar and wind installations eliminates these losses and provides price certainty that hedges against volatile wholesale electricity markets. Harch Energy's renewable installations, which serve Harch Intelligence's GPU fleet, follow this model — dedicated solar and wind capacity connected directly to data center infrastructure, with grid connection serving as backup and off-peak balancing.
Carbon-aware scheduling amplifies the benefit of renewable integration. By aligning compute workloads with periods of peak renewable generation — midday for solar, overnight for wind — carbon-aware systems like HarchOS maximize the fraction of compute powered by renewable energy without requiring expensive battery storage. This is particularly effective for AI training workloads, which can be scheduled flexibly over periods of days or weeks, and for batch inference workloads that can be processed during renewable generation peaks and served from cache during low-generation periods.
AI energy demand is growing at 25-40% annually, driven by three factors: the scaling of existing models (larger training runs), the deployment of new models (more companies training proprietary models), and the exponential growth in inference volume (more users, more applications, more queries). The IEA projects that global data center electricity consumption will reach 800-1,000 TWh by 2026, up from approximately 460 TWh in 2022. Within this total, AI-specific workloads are the fastest-growing segment.
This growth trajectory presents both a challenge and an opportunity. The challenge is that grid infrastructure cannot be expanded at the same pace — new transmission lines, substations, and generation capacity take 5-10 years to plan and build. The opportunity is that AI workloads are among the most flexible large-scale electricity consumers, capable of shifting demand across time and geography in ways that traditional industrial loads cannot. Data centers that implement carbon-aware scheduling, locate in regions with abundant renewable energy, and invest in high-efficiency cooling will not only reduce their environmental impact but will also gain a structural cost advantage as electricity prices increasingly reflect carbon intensity.
North Africa is positioned to become a critical hub for AI compute infrastructure, combining three advantages that no other region can match simultaneously. First, world-class renewable energy resources: Morocco's solar and wind capacity is growing at 20-25% annually, with dedicated installations for industrial consumers providing power at costs below $0.03/kWh — among the lowest in the world. Second, geographic proximity to European demand: submarine fiber cables connect Morocco to European internet exchange points with latency under 10 milliseconds, enabling AI inference workloads to serve European users from Moroccan infrastructure with imperceptible delay. Third, favorable trade and regulatory frameworks: Morocco's free trade agreements with the EU, the U.S., and African nations provide tariff-free access to major markets, and the country's data protection laws are harmonized with EU GDPR, simplifying compliance for European customers.
Harch Corp's vertically integrated model — combining Harch Energy's renewable generation, Harch Intelligence's GPU compute, and Harch Technology's connectivity infrastructure — captures these advantages in a single offering. The Dakhla 500MW data center, powered by dedicated renewable capacity and operated with carbon-aware scheduling achieving 47 gCO2/kWh, represents the blueprint for sustainable AI infrastructure at scale. As AI energy demand continues its exponential growth, the regions that can deliver the most compute per dollar and per tonne of CO2 will win the next decade of AI infrastructure investment. North Africa, and Morocco in particular, has the fundamentals to lead.
Related Topics
More Articles