The non-English internet is already
bigger than you think.
English speakers assume the internet speaks their language. The data tells a different story — and it’s changing faster than most people realize.
If you’ve grown up online in North America, Western Europe, or Australia, you’ve probably absorbed a particular assumption: the internet, more or less, operates in English. English-language platforms dominate the tools you use. The content you consume is in English. The discourse you participate in is in English. It’s easy to conclude that this is simply what the internet is.
The data disagrees. As of 2024, English accounts for approximately 52% of website content by some measures — a significant majority, but not an overwhelming one. More importantly, English represents only around 16% of internet users by native language. The gap between the language of the content and the language of the users is not a footnote. It is the defining structural fact of the global internet.
The language shift in numbers
The scale of non-English internet use is easier to understand when you look at individual language populations rather than aggregate percentages.
- Chinese internet users: 1.05 billion — the largest single-language online population on earth. By sheer user count, Chinese is the most-spoken language on the internet. The infrastructure serving these users — WeChat, Weibo, Douyin, Baidu, Taobao — is largely invisible to English-speaking internet users, which makes it easy to underestimate its scale.
- Hindi internet users: rapidly growing — as mobile internet penetrates rural India, Hindi-speaking users are joining the internet in large numbers. India’s internet population is already over 750 million and growing faster than almost any other country. Hindi is not the only language in India, but it is the most common, and the mobile-first internet that Indian users are accessing is increasingly Hindi-first.
- Arabic, Spanish, Portuguese, Russian, Japanese: each has 100 million or more online users. These are not niche communities — they are continental-scale language populations with their own content ecosystems, social networks, and communication infrastructure.
The implication is worth sitting with. When you use the internet, you are part of a minority language group on a network whose users are overwhelmingly non-English speakers. The internet feels English-dominated because the content is English-dominated. But the internet’s user base is not.
Where the content is
The English dominance of content is real, but it is less total than it appears to English speakers, and it is diminishing across every platform.
Wikipedia
The English Wikipedia is the largest edition by article count, but it is one of 58 editions that have 100,000 or more articles. The Japanese, German, French, Spanish, and Russian editions each contain millions of articles. The Cebuano edition — a language spoken by roughly 25 million people in the Philippines — has more articles than the English edition by raw count, owing to automated generation of geography stubs. The point is not that Cebuano Wikipedia is the most comprehensive — it isn’t — but that the assumption that Wikipedia is essentially an English-language resource understates the depth of its multilingual content by a significant margin.
YouTube
The fastest-growing YouTube markets in 2023 and 2024 were India, Indonesia, and Brazil — all non-English. Korean, Spanish, and Hindi creators have some of the platform’s highest engagement rates globally. K-pop content regularly dominates trending lists in countries where Korean is not spoken. Spanish-language music channels have hundreds of millions of subscribers. The platform’s linguistic center of gravity has been shifting for years; what used to be an English-majority content ecosystem is now genuinely multilingual at the top of the engagement rankings.
TikTok
The majority of TikTok content and engagement is non-English. The algorithm surfaces content by engagement pattern rather than by language, which means non-English content regularly reaches English-speaking audiences and vice versa. This makes TikTok the most genuinely multilingual of the major Western-accessible social platforms — not because it was designed to be, but because its engagement-first recommendation engine is language-agnostic in a way that its predecessors were not.
The communication gap
Despite the scale of non-English internet use, communication tools still default to English in ways that create a persistent structural disadvantage for non-English speakers.
Most collaborative platforms — Slack, Teams, Notion, Figma, GitHub — are English-first. Their interfaces may be localized, but their default interaction norms, their community documentation, and their power-user communities are English-speaking. Non-English users who want to access the full capability of these tools often need to read English documentation, participate in English-language forums, and communicate with English-speaking support teams.
Social networks that nominally support multiple languages still have English-dominant recommendation engines. The content that surfaces in global trending feeds is disproportionately English-language content, not because English-language content is more engaging, but because the systems were built, tuned, and iterated primarily against English-speaking user behavior.
Non-English internet users are consuming content in their language but communicating in English — often poorly — or not at all.
This is the communication gap. Non-English users can read content in their language. They can consume media in their language. But when it comes to participating — in discussions, in communities, in real-time conversation with people outside their language group — the tools push them toward English or keep them within their language community. There is no good middle path between “communicate in English at a disadvantage” and “stay within your language group.”
What changes when the tools catch up
Real-time translation is the layer that changes this equation. It is not a translation button on a forum post — it is a structural shift in who can participate in a conversation and under what conditions.
When real-time translation works invisibly, a Brazilian user and a Korean user can have a voice conversation about anything — a shared hobby, a business arrangement, a friendship — without either party needing to communicate in a third language they may not speak well. The conversation happens in both languages simultaneously. The friction that previously kept them in separate communities disappears.
The non-English internet isn’t coming. It’s here. The tools are catching up. And the gap they’re closing is not a minor convenience gap — it is the gap between participation and exclusion for more than three billion people.
The next wave of internet growth is happening in South Asia, Sub-Saharan Africa, and Southeast Asia — regions where English is rarely the primary language. The users coming online in the next decade are overwhelmingly going to be non-English speakers. The platforms and tools that assume English as a default will reach fewer of them. The ones that build for multilingual participation from the ground up will reach everyone.
That is what Babel is designed around. Not a translation feature added to an English-first product. A communication layer built for a multilingual internet from the start — because the internet has always been multilingual. Most of its tools just haven’t been.
Babel is built for every language on the internet.
Speak any language. Reach every community. Start for free.
Get Babel Free →Related reading: Why Real-Time Translation Changes Everything · The Rise of the Non-English Internet · Who uses Babel