aboutsummaryrefslogtreecommitdiffstats
path: root/modules/devices.py
Commit message (Collapse)AuthorAgeFilesLines
* Updatewangshuai092024-01-311-0/+7
|
* Merge branch 'dev' into npu_supportwangshuai092024-01-301-3/+95
|\
| * Revert "Try to reverse the dtype checking mechanism"Kohaku-Blueleaf2024-01-291-2/+5
| | | | | | | | This reverts commit d243e24f539d717b221992e894a5db5a321bf3cd.
| * Try to reverse the dtype checking mechanismKohaku-Blueleaf2024-01-291-5/+2
| |
| * lintingKohaku-Blueleaf2024-01-291-1/+0
| |
| * Fix potential bugsKohaku-Blueleaf2024-01-291-2/+7
| |
| * Avoid exceptions to be silencedKohaku-Blueleaf2024-01-201-6/+5
| |
| * Avoid early disableKohaku-Blueleaf2024-01-201-0/+4
| |
| * Fix nested manual castKohaku-Blueleaf2024-01-181-1/+5
| |
| * rearrange if-statements for cpuKohaku-Blueleaf2024-01-091-3/+3
| |
| * Apply the correct behavior of precision='full'Kohaku-Blueleaf2024-01-091-4/+7
| |
| * Revert "Apply correct inference precision implementation"Kohaku-Blueleaf2024-01-091-33/+9
| | | | | | | | This reverts commit e00365962b17550a42235d1fbe2ad2c7cc4b8961.
| * Apply correct inference precision implementationKohaku-Blueleaf2024-01-091-9/+33
| |
| * linting and debugsKohaku-Blueleaf2024-01-091-6/+6
| |
| * Fix bugs when arg dtype doesn't matchKohakuBlueleaf2024-01-091-15/+10
| |
| * improve efficiency and support more deviceKohaku-Blueleaf2024-01-091-17/+43
| |
| * change import statements for #14478AUTOMATIC11112023-12-311-2/+2
| |
| * Add utility to inspect a model's parameters (to get dtype/device)Aarni Koskela2023-12-311-1/+2
| |
| * Merge branch 'dev' into test-fp8Kohaku-Blueleaf2023-12-031-0/+13
| |\
| * \ Merge branch 'dev' into test-fp8Kohaku-Blueleaf2023-12-021-1/+1
| |\ \
| * | | Better namingKohaku-Blueleaf2023-11-191-3/+3
| | | |
| * | | Use options instead of cmd_argsKohaku-Blueleaf2023-11-191-11/+14
| | | |
| * | | Add MPS manual castKohakuBlueleaf2023-10-281-1/+5
| | | |
| * | | ManualCast for 10/16 series gpuKohaku-Blueleaf2023-10-281-6/+51
| | | |
| * | | Add CPU fp8 supportKohaku-Blueleaf2023-10-231-1/+5
| | | | | | | | | | | | | | | | | | | | | | | | Since norm layer need fp32, I only convert the linear operation layer(conv2d/linear) And TE have some pytorch function not support bf16 amp in CPU. I add a condition to indicate if the autocast is for unet.
* | | | Add NPU Supportwangshuai092024-01-291-2/+7
| |_|/ |/| |
* | | Merge pull request #14171 from Nuullll/ipexAUTOMATIC11112023-12-021-0/+13
|\ \ \ | |_|/ |/| | Initial IPEX support for Intel Arc GPU
| * | Disable ipex autocast due to its bad perfNuullll2023-12-021-7/+13
| | |
| * | Initial IPEX supportNuullll2023-11-301-2/+9
| |/
* | Merge pull request #14131 from read-0nly/patch-1AUTOMATIC11112023-12-021-1/+1
|\ \ | |/ |/| Update devices.py - Make 'use-cpu all' actually apply to 'all'
| * Update devices.pyobsol2023-11-281-1/+1
| | | | | | | | | | fixes issue where "--use-cpu" all properly makes SD run on CPU but leaves ControlNet (and other extensions, I presume) pointed at GPU, causing a crash in ControlNet caused by a mismatch between devices between SD and CN https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/14097
* | fix for crash when running #12924 without --device-idAUTOMATIC11112023-09-091-1/+1
| |
* | More accurate check for enabling cuDNN benchmark on 16XX cardscatboxanon2023-08-311-1/+2
|/
* split shared.py into multiple files; should resolve all circular reference ↵AUTOMATIC11112023-08-091-9/+1
| | | | import errors related to shared.py
* rework RNG to use generators instead of generating noises beforehandAUTOMATIC11112023-08-091-79/+2
|
* rework torchsde._brownian.brownian_interval replacement to use ↵AUTOMATIC11112023-08-031-6/+38
| | | | device.randn_local and respect the NV setting.
* add NV option for Random number generator source setting, which allows to ↵AUTOMATIC11112023-08-021-2/+37
| | | | generate same pictures on CPU/AMD/Mac as on NVidia videocards.
* Fix MPS cache cleanupAarni Koskela2023-07-111-2/+3
| | | | Importing torch does not import torch.mps so the call failed.
* added torch.mps.empty_cache() to torch_gc()AUTOMATIC11112023-07-081-0/+3
| | | | changed a bunch of places that use torch.cuda.empty_cache() to use torch_gc() instead
* Remove a bunch of unused/vestigial codeAarni Koskela2023-06-051-7/+0
| | | | As found by Vulture and some eyes
* run basic torch calculation at startup in parallel to reduce the performance ↵AUTOMATIC2023-05-211-0/+18
| | | | impact of first generation
* ruff auto fixesAUTOMATIC2023-05-101-1/+1
|
* rename CPU RNG to RNG source in settings, add infotext and parameters ↵AUTOMATIC2023-04-291-2/+2
| | | | copypaste support to RNG source
* Option to use CPU for random number generation.Deciare2023-04-191-2/+6
| | | | | | | Makes a given manual seed generate the same images across different platforms, independently of the GPU architecture in use. Fixes #9613.
* Refactor Mac specific code to a separate filebrkirch2023-02-011-45/+7
| | | | Move most Mac related code to a separate file, don't even load it unless web UI is run under macOS.
* Refactor MPS fixes to CondFuncbrkirch2023-02-011-36/+14
|
* MPS fix is still needed :(brkirch2023-02-011-0/+3
| | | | Apparently I did not test with large enough images to trigger the bug with torch.narrow on MPS
* Merge pull request #7309 from brkirch/fix-embeddingsAUTOMATIC11112023-01-281-3/+8
|\ | | | | Fix embeddings, upscalers, and refactor `--upcast-sampling`
| * Remove MPS fix no longer needed for PyTorchbrkirch2023-01-281-3/+0
| | | | | | | | The torch.narrow fix was required for nightly PyTorch builds for a while to prevent a hard crash, but newer nightly builds don't have this issue.
| * Refactor conditional casting, fix upscalersbrkirch2023-01-281-0/+8
| |