Alexandria Logo

Religion
Case Law
Research
Patents
Downloads
About Us

Downloads

Timestamp
Description
Documents
Size
Link

2023-05-04

All papers on Arxiv.org embedded by title using the InstructorXL model.

2.3 M

6.5 GB

2023-05-04

All papers on Arxiv.org embedded by abstract using the InstructorXL model.

2.3 M

7.6 GB

2023-06-14

All major religious texts embedded using the Ada-002 model.

50 M

20 GB

↓   Help us decide what to embed next by voting below!   ↓

??????

All US cases from the Case Law Project using the InstructorXL model.

36.3 M

~80 GB

??????

All patents on USPTO embedded using the InstructorXL model.

18.2 M

~61 GB

??????

All of English Wikipedia embedded using the InstructorXL model.

6.6 M

~22 GB

??????

All repositories on Github using a to-be-determined model.

3.1 B

~3.4 TB

Support

Help support our initiative by building with our datasets, voting on what you want next, spreading the word about the project, or even donating to help us in our mission to embed the internet. If you’re interested in our work or want to contribute, get in touch!

PS. Are you a vector database company? Want the future to be built on your platform? We're looking for partners to build Alexandria.