The contents of PyPI, in numbers

Total files
1.05 Billion
78,926,653 unique
Total lines of text
327.9 Billion
327,927,103,998 to be precise
Total uncompressed size
56.3 TiB
That is ~42,169,471.024 floppy disks
Lines of code added per second
3,067
In the month 2023-08-01

This page contains a breakdown of the contents of PyPI from parsing the contents of packages. You can download PyPI locally to do your own analysis or run SQL queries on the data in your browser

Language Features

This data only counts unique projects, not versions. e.g if a project has published 10 versions in a month, each containing an async function, it will only be counted once.

Breakdown

NameProjectsPercent
list comp226,07249
fstring155,22634
annotations131,14329
generator expression121,22926
dict comp89,23219
async29,3296
dataclasses26,9356
set comp21,7465
walrus9,7282
match2,6761
async comp1,0640
try star220

Project Contents

This data only counts unique projects, not versions. e.g if a project has published 10 versions in a month, each with a setup.py file, it will only be counted once.

Secrets Detected

PyPI contains a lot of secrets.
typecount
Google API Key4,053
OpenAI API Key3,568
Tencent Cloud Secret ID1,901
Amazon AWS Secret Access Key1,640
Amazon AWS Access Key ID1,373
Google Cloud Private Key ID1,098
Slack API Token1,063
Telegram Bot Token868
Slack Incoming Webhook URL784
SendGrid API Key748
Mailgun API Key718
Mailchimp API Key676
Stripe API Key662
Twilio Account String Identifier576
Alibaba Cloud AccessKey Secret555
Total25,826

Growth

8.92 million

60TB

1 billion

PyPI is growing fast. If this dangerous expansion not stopped, our advanced machine learning models predict that in only 8 years the number of packages will outnumber human beings.

Witness this inevitable future

Binary files

This shows a breakdown of the binary files on PyPI, by extension. Binary files are the vast majority of the content on PyPI, accounting for nearly 75% of the uncompressed size.

extensiontotal filestotal sizeunique files
.so6,415,86919.7 TiB3,574,770
.pyd1,713,4984.0 TiB1,573,713
.dylib1,058,3972.7 TiB348,995
.dll1,266,8811.9 TiB372,103
No extension4,754,7191.9 TiB1,663,203
.2149,1141.4 TiB17,297
.0586,2251.2 TiB65,414
.jar412,313837.4 GiB42,548
.png24,858,234491.2 GiB765,834
.1284,505447.2 GiB39,041
.lib112,458430.6 GiB32,836
.exe194,819390.8 GiB44,652
.gz4,168,631380.0 GiB548,053
.tgz335,368357.9 GiB154,603
.736,771304.7 GiB2,497
Total46,347,80236.4 TiB9,245,559

Largest Projects by size

Tensorflow dominates this list with 8.9 TiB of uncompressed data, 16% of all data on PyPI.
project nameunique filestotal filestotal linestotal size
tf-nightly85,75020,242,4238,325,796,0682.3 TiB
tf-nightly-cpu80,84219,960,8248,080,533,3161.8 TiB
tf-nightly-gpu70,45711,657,7564,814,002,0481.4 TiB
lalsuite1,719,1539,969,5054,542,135,4541.1 TiB
tensorflow98,0077,661,7372,882,658,484864.7 GiB
paddlepaddle-gpu30,8592,000,670433,161,447856.7 GiB
tensorflow-io-nightly14,020927,623116,109,192742.5 GiB
tf-nightly-cpu-aws49,6558,372,1983,236,007,762683.7 GiB
tensorflow-gpu83,9324,276,5041,575,491,698638.0 GiB
catboost-dev32,526256,62066,348,683582.2 GiB
tf-nightly-intel77,1317,263,5102,958,885,573488.4 GiB
tensorflow-cpu55,8424,899,6811,903,928,312476.2 GiB
tf-nightly-macos25,6624,949,0692,355,608,468463.8 GiB
OpenVisus60,4843,708,820730,470,294437.1 GiB
graphscope-client36,3041,294,627149,056,559425.4 GiB
Total2,520,624107,441,56742,170,193,35813.1 TiB

Stats By Extensions

This only considers the last suffix of the file path as the extension
extensiontotal filestotal linestotal sizeunique files
.py456,168,853119,469,907,8934.4 TiB30,833,340
.h80,898,96024,382,766,597963.8 GiB693,320
No extension57,772,2347,667,850,4602.3 TiB15,549,473
.json52,329,20621,161,457,0551.0 TiB1,574,195
.hpp38,587,0347,946,850,198320.6 GiB278,247
.txt37,176,48817,224,921,091635.3 GiB3,289,949
.js30,938,55712,515,763,9471.0 TiB1,299,028
.png24,893,739899,161491.3 GiB768,446
.rst21,516,4951,350,186,97551.8 GiB1,229,223
.svg16,958,2431,361,240,312171.2 GiB314,635
.pyi16,357,1543,166,271,355104.7 GiB456,382
.html15,165,6312,830,097,499201.3 GiB1,586,120
.yaml11,662,1471,102,201,41239.6 GiB326,808
.pyc10,661,980276,45867.2 GiB4,970,135
.yml9,967,343717,085,60621.3 GiB297,940
Total881,054,064220,897,776,01911.7 TiB63,467,241