Unverified Commit a570e2ba authored by Mayank Mishra's avatar Mayank Mishra Committed by GitHub
Browse files

add shared experts for upcoming Granite 4.0 language models (#35894)


* Modular GraniteMoE with shared Experts.

Signed-off-by: default avatarShawn Tan <shawntan@ibm.com>

* Modified

* Import order.

* Modified for style

* Fix space.

* Test

* Remove extra granitemoe file.

* New converted file and tests

* Modified __init__ files.

* Formatting.

* Dummy PT objects

* register granitemoe shared model

Signed-off-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>

* fix linting of a file

Signed-off-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>

* fix import in modeling file

Signed-off-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>

* update generated modeling file

Signed-off-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>

* add documentation

Signed-off-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>

* update docstrings

Signed-off-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>

* update generated modeling file

Signed-off-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>

* fix docstrings in config class

Signed-off-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>

* merge main

Signed-off-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>

---------

Signed-off-by: default avatarShawn Tan <shawntan@ibm.com>
Signed-off-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>
Co-authored-by: default avatarShawn Tan <shawntan@ibm.com>
Co-authored-by: default avatarShawn Tan <shawn@wtf.sg>
Co-authored-by: default avatarSukriti-Sharma4 <sukriti.sharma4@ibm.com>
Co-authored-by: default avatarSukriti Sharma <Ssukriti@users.noreply.github.com>
parent 7ae7e87a
No related merge requests found
Showing with 2517 additions and 0 deletions
+2517 -0
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment