Skip to content

nulib-labs/Impulse

Repository files navigation

Now 100% sugar free

New features

Impulse manifests: You can now build out meta-manifests for Impulse! See ./tests/impulse_manifest/manifest.yaml for details.

You can use that manifest with uv run lpad add ./tests/impulse_manifest/manifest.yaml

Impulse Logo

Impulse is an easy-to-deploy all-in-one solution for extremely high-throuput embarrassingly parallel data extraction built for digitization, indexing, and distribution through IIIF manifests.

Running on Northwestern University's Quest supercomputing cluster, we observed hundreds of pages per hour. (This is while limiting ourselves to 50 concurrent jobs!) This could easily hit thousands of pages per hour.

Flex

This code is very easily extensible to new models/paradigms. Just edit the impulse.tasks.DocumentExtraction class!

This project would not be possible without the work of other amazing open- source projects.

ImpulseSuperCompute

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages