transformers code fixes

#2
by pszemraj - opened

hi! coming from a previous issue in the v1.1, I noticed that there were still issues using the model in transformers itself, so I went and implemented some fixes and you can see the resulting "fork" repo here. it works in transformers, as you can see in this colab here

some notes:

  • if you find this useful feel free to take whatever parts of the fork you like! additionally, happy to PR some/all of it, I did not lead with that as there are several changes (including processor cleanup to make it easier to import) so diff is large
  • I confirm the vllm path works with no custom code needed

Thanks for sharing this @pszemraj . We have made changes recently in #4 and #6 to extend support for the latest versions of transformers and vLLM. So going to close this now.

nvidia-oliver-holworthy changed discussion status to closed

Sign up or log in to comment