• Yoach Lacombe's avatar
    Add MusicGen Melody (#28819) · c43b380e
    Yoach Lacombe authored
    
    
    * first modeling code
    
    * make repository
    
    * still WIP
    
    * update model
    
    * add tests
    
    * add latest change
    
    * clean docstrings and copied from
    
    * update docstrings md and readme
    
    * correct chroma function
    
    * correct copied from and remove unreleated test
    
    * add doc to toctree
    
    * correct imports
    
    * add convert script to notdoctested
    
    * Add suggestion from Sanchit
    
    Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
    
    * correct get_uncoditional_inputs docstrings
    
    * modify README according to SANCHIT feedback
    
    * add chroma to audio utils
    
    * clean librosa and torchaudio hard dependencies
    
    * fix FE
    
    * refactor audio decoder -> audio encoder for consistency with previous musicgen
    
    * refactor conditional -> encoder
    
    * modify sampling rate logics
    
    * modify license at the beginning
    
    * refactor all_self_attns->all_attentions
    
    * remove ignore copy from causallm generate
    
    * add copied from for from_sub_models
    
    * fix make copies
    
    * add warning if audio is truncated
    
    * add copied from where relevant
    
    * remove artefact
    
    * fix convert script
    
    * fix torchaudio and FE
    
    * modify chroma method according to feedback-> better naming
    
    * refactor input_values->input_features
    
    * refactor input_values->input_features and fix import fe
    
    * add input_features to docstrigs
    
    * correct inputs_embeds logics
    
    * remove dtype conversion
    
    * refactor _prepare_conditional_hidden_states_kwargs_for_generation ->_prepare_encoder_hidden_states_kwargs_for_generation
    
    * change warning for chroma length
    
    * Update src/transformers/models/musicgen_melody/convert_musicgen_melody_transformers.py
    
    Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
    
    * change way to save wav, using soundfile
    
    * correct docs and change to soundfile
    
    * fix import
    
    * fix init proj layers
    
    * remove line breaks from md
    
    * fix issue with docstrings
    
    * add FE suggestions
    
    * improve is in logics and remove useless imports
    
    * remove custom from_pretrained
    
    * simplify docstring code
    
    * add suggestions for modeling tests
    
    * make style
    
    * update converting script with sanity check
    
    * remove encoder attention mask from conditional generation
    
    * replace musicgen melody checkpoints with official orga
    
    * rename ylacombe->facebook in checkpoints
    
    * fix copies
    
    * remove unecessary warning
    
    * add shape in code docstrings
    
    * add files to slow doc tests
    
    * fix md bug and add md to not_tested
    
    * make fix-copies
    
    * fix hidden states test and batching
    
    ---------
    
    Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
    c43b380e