Skip to content

Latest commit

 

History

History
11 lines (9 loc) · 773 Bytes

File metadata and controls

11 lines (9 loc) · 773 Bytes

crossmodal

paper

> script/setup # Clone AudioSet models and download pre-trained weights
> python -m crossmodal.embedding +embedding.input_dir='.../audio/' +embedding.algo='openl3' +embedding.output_dir='.../embeddings/'
> python -m crossmodal.embedding +embedding.input_dir='.../video/' +embedding.algo='resnet' +embedding.output_dir='.../embeddings/'
> python -m crossmodal.train +train.embedding_dir='.../embeddings/' +train.model_dir='.../models/' +train.audio_alg='openl3' +train.image_alg='resnet'
> python -m crossmodal.translate +translate.embedding_dir='.../embeddings/' +translate.output_dir='.../translated_embeddings/' +train.model_file='.../models/model.ckpt' +train.audio_alg='openl3' +train.image_alg='resnet'