Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | flatten color_values | Ameer J | 2023-08-10 | 1 | -14/+9 |
| | |||||
* | flatten encoding_values | Ameer J | 2023-08-10 | 1 | -11/+10 |
| | |||||
* | flatten result vector | Ameer J | 2023-08-10 | 1 | -14/+5 |
| | |||||
* | GetUnquantizedWeightVector | Ameer J | 2023-08-09 | 1 | -69/+63 |
| | |||||
* | Compute Replicate | Ameer J | 2023-08-06 | 1 | -85/+20 |
| | |||||
* | minor | Ameer J | 2023-08-06 | 1 | -12/+6 |
| | |||||
* | undo uint | Ameer J | 2023-08-06 | 1 | -3/+3 |
| | |||||
* | Revert "vulkan dims specialization" | Ameer J | 2023-08-06 | 1 | -26/+16 |
| | | | | This reverts commit e6243058f2269bd79ac8479d58e55feec2611e9d. | ||||
* | vulkan dims specialization | ameerj | 2023-08-06 | 1 | -16/+26 |
| | |||||
* | small_block opt | Ameer J | 2023-08-06 | 1 | -4/+3 |
| | |||||
* | remove TexelWeightParams | Ameer J | 2023-08-06 | 1 | -46/+31 |
| | |||||
* | error/void extent funcs | Ameer J | 2023-08-06 | 1 | -48/+43 |
| | |||||
* | more packing | Ameer J | 2023-08-06 | 1 | -109/+109 |
| | |||||
* | Revert "uint result index" | Ameer J | 2023-08-06 | 1 | -1/+1 |
| | | | | This reverts commit 0e978786b5a8e7382005d8b1e16cfa12f3eeb775. | ||||
* | Revert "bfe instead of mod" | Ameer J | 2023-08-06 | 1 | -15/+13 |
| | | | | This reverts commit 86006a3b09e8a8c17d2ade61be76736a79e3f58a. | ||||
* | Revert "global endpoints" | Ameer J | 2023-08-06 | 1 | -36/+40 |
| | | | | This reverts commit d8f5bfd1df2b7469ef6abcee182aa110602d1751. | ||||
* | global endpoints | Ameer J | 2023-08-06 | 1 | -40/+36 |
| | |||||
* | bfe instead of mod | Ameer J | 2023-08-06 | 1 | -13/+15 |
| | |||||
* | uint result index | Ameer J | 2023-08-06 | 1 | -1/+1 |
| | |||||
* | amd opts | Ameer J | 2023-08-06 | 1 | -16/+13 |
| | |||||
* | const, pack result_vector and replicate tables, | Ameer J | 2023-08-06 | 1 | -227/+260 |
| | | | | undo amd opts | ||||
* | minor redundancy cleanup | Ameer J | 2023-08-06 | 1 | -12/+2 |
| | |||||
* | extractbits robustness | Ameer J | 2023-08-06 | 1 | -5/+8 |
| | |||||
* | reuse vectors memory | Ameer J | 2023-08-06 | 1 | -33/+17 |
| | |||||
* | EncodingData pack | Ameer J | 2023-08-06 | 1 | -44/+69 |
| | |||||
* | flattening | Ameer J | 2023-08-06 | 1 | -43/+44 |
| | |||||
* | weights refactor | Ameer J | 2023-08-06 | 1 | -26/+22 |
| | |||||
* | params.max_weight | Ameer J | 2023-08-06 | 1 | -5/+2 |
| | |||||
* | skip bits | Ameer J | 2023-08-06 | 1 | -9/+14 |
| | |||||
* | restrict | Ameer J | 2023-08-06 | 1 | -2/+2 |
| | |||||
* | general: fix spelling mistakes | Liam | 2023-03-12 | 1 | -1/+1 |
| | |||||
* | video_core: Modify astc texture decode error fill value | FengChen | 2022-09-15 | 1 | -1/+1 |
| | |||||
* | general: Convert source file copyright comments over to SPDX | Morph | 2022-04-23 | 1 | -3/+2 |
| | | | | | This formats all copyright comments according to SPDX formatting guidelines. Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later. | ||||
* | astc_decoder: Combine FastReplicate functions to work around new NV driver bug | ameerj | 2022-01-16 | 1 | -34/+46 |
| | | | | | | The new Nvidia drivers have a bug where the FastReplicateTo6 function produces a lookup into the REPLICATE_TO_8 table rather than the REPLICATE_TO_6 table. This seems to be an optimization gone wrong. Combining the logic of the FastReplicate functions seems to address the bug. | ||||
* | astc_decoder: Reduce workgroup size | ameerj | 2021-08-01 | 1 | -1/+1 |
| | | | | This reduces the amount of over dispatching when there are odd dimensions (i.e. ASTC 8x5), which rarely evenly divide into 32x32. | ||||
* | astc_decoder: Compute offset swizzles in-shader | ameerj | 2021-08-01 | 1 | -33/+13 |
| | | | | Alleviates the dependency on the swizzle table and a uniform which is constant for all ASTC texture sizes. | ||||
* | astc_decoder: Make use of uvec4 for payload data | ameerj | 2021-08-01 | 1 | -79/+43 |
| | |||||
* | astc_decoder: Simplify Select2DPartition | ameerj | 2021-08-01 | 1 | -38/+19 |
| | |||||
* | astc_decoder: Optimize the use EncodingData | ameerj | 2021-08-01 | 1 | -25/+25 |
| | | | | | | | This buffer was a list of EncodingData structures sorted by their bit length, with some duplication from the cpu decoder implementation. We can take advantage of its sorted property to optimize its usage in the shader. Thanks to wwylele for the optimization idea. | ||||
* | astc_decoder.comp: Remove unnecessary LUT SSBOs | ameerj | 2021-06-19 | 1 | -19/+16 |
| | | | | We can move them to instead be compile time constants within the shader. | ||||
* | astc: Various robustness enhancements for the gpu decoder | ameerj | 2021-06-19 | 1 | -9/+6 |
| | | | | | | These changes should help in reducing crashes/drivers panics that may occur due to synchronization issues between the shader completion and later access of the decoded texture. | ||||
* | astc_decoder: Fix LDR CEM1 endpoint calculation | ameerj | 2021-06-16 | 1 | -1/+1 |
| | | | | | | | | Per the spec, L1 is clamped to the value 0xff if it is greater than 0xff. An oversight caused us to take the maximum of L1 and 0xff, rather than the minimum. Huge thanks to wwylele for finding this. Co-Authored-By: Weiyi Wang <wwylele@gmail.com> | ||||
* | astc_decoder: Refactor for style and more efficient memory use | ameerj | 2021-03-25 | 1 | -262/+307 |
| | |||||
* | astc_decoder: Reimplement Layers | Rodrigo Locatti | 2021-03-13 | 1 | -18/+15 |
| | | | | Reimplements the approach to decoding layers in the compute shader. Fixes multilayer astc decoding when using Vulkan. | ||||
* | astc_decoder: Fix out of bounds memory access | ameerj | 2021-03-13 | 1 | -2/+10 |
| | | | | resolves a crash with some anamolous textures found in Astral Chain. | ||||
* | renderer_vulkan: Accelerate ASTC decoding | ameerj | 2021-03-13 | 1 | -21/+22 |
| | | | | Co-Authored-By: Rodrigo Locatti <reinuseslisp@airmail.cc> | ||||
* | renderer_opengl: Accelerate ASTC texture decoding with a compute shader | ameerj | 2021-03-13 | 1 | -0/+1288 |
ASTC texture decoding is currently handled by a CPU decoder for GPU's without native ASTC decoding support (most desktop GPUs). This is the cause for noticeable performance degradation in titles which use the format extensively. This commit adds support to accelerate ASTC decoding using a compute shader on OpenGL for GPUs without native support. |