• Yih-Dar's avatar
    Add `Kosmos-2` model (#24709) · 691fd8fd
    Yih-Dar authored
    
    
    * Add KOSMOS-2 model
    
    * update
    
    * update
    
    * update
    
    * address review comment - 001
    
    * address review comment - 002
    
    * address review comment - 003
    
    * style
    
    * Apply suggestions from code review
    
    Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
    
    * fix
    
    * address review comment - 004
    
    * address review comment - 005
    
    * address review comment - 006
    
    * address review comment - 007
    
    * address review comment - 008
    
    * address review comment - 009
    
    * address review comment - 010
    
    * address review comment - 011
    
    * update readme
    
    * fix
    
    * fix
    
    * fix
    
    * [skip ci] fix
    
    * revert the change in _decode
    
    * fix docstring
    
    * fix docstring
    
    * Update docs/source/en/model_doc/kosmos-2.md
    
    Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
    
    * no more Kosmos2Tokenizer
    
    * style
    
    * remove "returned when being computed by the model"
    
    * Apply suggestions from code review
    
    Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
    
    * UTM5 Atten
    
    * fix attn mask
    
    * use present_key_value_states instead of next_decoder_cache
    
    * style
    
    * conversion scripts
    
    * conversion scripts
    
    * conversion scripts
    
    * Add _reorder_cache
    
    * fix doctest and copies
    
    * rename 1
    
    * rename 2
    
    * rename 3
    
    * make fixup
    
    * fix table
    
    * fix docstring
    
    * rename 4
    
    * change repo_id
    
    * remove tip
    
    * update md file
    
    * make style
    
    * update md file
    
    * put docs/source/en/model_doc/kosmos-2.md to slow
    
    * update conversion script
    
    * Use CLIPImageProcessor in Kosmos2Processor
    
    * Remove Kosmos2ImageProcessor
    
    * Remove to_dict in Kosmos2Config
    
    * Remove files
    
    * fix import
    
    * Update conversion
    
    * normalized=False
    
    * Not using hardcoded values like <image>
    
    * elt --> element
    
    * Apply suggestion
    
    * Not using hardcoded values like </image>
    
    * No assert
    
    * No nested functions
    
    * Fix md file
    
    * copy
    
    * update doc
    
    * fix docstring
    
    * fix name
    
    * Remove _add_remove_spaces_around_tag_tokens
    
    * Remove dummy docstring of _preprocess_single_example
    
    * Use `BatchEncoding`
    
    * temp
    
    * temp
    
    * temp
    
    * Update
    
    * Update
    
    * Make Kosmos2ProcessorTest a bit pretty
    
    * Update gradient checkpointing
    
    * Fix gradient checkpointing test
    
    * Remove one liner remove_special_fields
    
    * Simplify conversion script
    
    * fix add_eos_token
    
    * update readme
    
    * update tests
    
    * Change to microsoft/kosmos-2-patch14-224
    
    * style
    
    * Fix doc
    
    ---------
    
    Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
    Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
    Co-authored-by: default avatarNielsRogge <48327001+NielsRogge@users.noreply.github.com>
    Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
    691fd8fd