[`Llama ROPE`] Fix torch export but also slow downs in forward (#29198) (8a8a0a4a) · Commits · zhusg / transformers-new

Unverified Commit 8a8a0a4a authored 1 year ago by

Arthur Committed by GitHub 1 year ago

[`Llama ROPE`] Fix torch export but also slow downs in forward (#29198)

* remove control flow

* update gptneox

* update ....

* nits

* Actually let's just break. Otherwise we are silently failing which imo is not optimal

* version BC

* fix tests

* fix eager causal

* nit

* add a test

* style

* nits

* nits

* more nits for the test

* update and fix

* make sure cuda graphs are not skipped

* read token is needed for meta llama

* update!

* fiixup

* compile test should be slow

* fix thet fix copies

* stle 🫠

parent 7c87f357

No related merge requests found

Hide whitespace changes

Inline Side-by-side

Showing with 75 additions and 23 deletions

+75 -23

Please register or to comment