aboutsummaryrefslogtreecommitdiffstats
path: root/modules/sub_quadratic_attention.py
AgeCommit message (Collapse)AuthorLines
2023-05-11Autofix Ruff W (not W605) (mostly whitespace)Aarni Koskela-1/+1
2023-05-11repair #10266AUTOMATIC-13/+5
2023-05-10Update sub_quadratic_attention.pyLouis Del Valle-6/+15
1. Determine the number of query chunks. 2. Calculate the final shape of the res tensor. 3. Initialize the tensor with the calculated shape and dtype, (same dtype as the input tensors, usually) Can initialize the tensor as a zero-filled tensor with the correct shape and dtype, then compute the attention scores for each query chunk and fill the corresponding slice of tensor.
2023-01-25Add UI setting for upcasting attention to float32brkirch-2/+2
Adds "Upcast cross attention layer to float32" option in Stable Diffusion settings. This allows for generating images using SD 2.1 models without --no-half or xFormers. In order to make upcasting cross attention layer optimizations possible it is necessary to indent several sections of code in sd_hijack_optimizations.py so that a context manager can be used to disable autocast. Also, even though Stable Diffusion (and Diffusers) only upcast q and k, unfortunately my findings were that most of the cross attention layer optimizations could not function unless v is upcast also.
2023-01-09Remove fallback for Protocol import and remove Protocol import and remove ↵AUTOMATIC-8/+11
instances of Protocol in code add some whitespace between functions to be in line with other code in the repo
2023-01-07Add fallback for Protocol importProGamerGov-1/+7
2023-01-06Added licensebrkirch-1/+1
2023-01-06Use narrow instead of dynamic_slicebrkirch-15/+19
2023-01-06Add Birch-san's sub-quadratic attention implementationbrkirch-0/+201