Use default attention implementation with option to override

#2

Enables specifying attn_implementation when loading model including spda

Thank you!

Publish this branch
This branch is in draft mode, publish it to be able to merge.

Sign up or log in to comment