$ fad_embed -husage: fad_embed [-h] [--batch_size BATCH_SIZE] [--sample_size SAMPLE_SIZE] [--chunk_size CHUNK_SIZE] [--hop_size HOP_SIZE] [--max_hops MAX_HOPS] [--sr SR] [--verbose][--debug]embed_model real_path fake_pathpositional arguments:embed_model choice of embedding model(s): clap |vggish|pann|openl3|allreal_path Path of files of real audiofake_path Path of files of fake audiooptions:-h,--help show this help message and exit--batch_size BATCH_SIZEMAXIMUM Batch size for computing embeddings (may go smaller)(default: 64)--sample_size SAMPLE_SIZENumber of audio samples to read from each audio file (default: 262144)--chunk_size CHUNK_SIZELength of chunks (inaudio samples)to embed (default: 24000)--hop_size HOP_SIZE (approximate)time difference (inseconds)between each chunk (default: 0.1)--max_hops MAX_HOPS Don't exceed this many hops/chunks/embeddings per audio file. <= 0 disables this. (default: -1) --sr SR sample rate (will resample inputs at this rate) (default: 48000) --verbose Show notices of resampling when reading files (default: False) --debug Extra messages for debugging this program (default: False)
First a couple utilities for downloading checkpoints:
/fsx/shawley/envs_sm/aa/lib/python3.10/site-packages/librosa/util/decorators.py:88: UserWarning: Empty filters detected in mel frequency basis. Some channels will produce empty responses. Try increasing your sampling rate (and fmax) or reducing n_mels.
return f(*args, **kwargs)
Downloading: "https://github.com/torchopenl3/torchopenl3-models/raw/master/torchopenl3_mel256_music_512.pth.tar" to /home/shawley/.cache/torch/hub/checkpoints/torchopenl3_mel256_music_512.pth.tar
100%|██████████| 34.9M/34.9M [00:00<00:00, 249MB/s]