summaryrefslogtreecommitdiffstats
path: root/src/shader_recompiler/backend (follow)
Commit message (Collapse)AuthorAgeFilesLines
* shader_recompiler: use only ConstOffset for OpImageFetchLiam2024-02-052-38/+7
|
* Merge pull request #12644 from liamwhite/vkspec-image-offsetliamwhite2024-01-192-23/+55
|\ | | | | shader_recompiler: fix Offset operand usage for non-OpImage*Gather
| * shader_recompiler: fix Offset operand usage for non-OpImage*GatherLiam2024-01-112-23/+55
| |
* | Merge remote-tracking branch 'origin/master' into typos3Viktor Szépe2024-01-163-8/+86
|\ \
| * | shader_recompiler: emulate 8-bit and 16-bit storage writes with cas loopLiam2024-01-113-8/+86
| |/
* / Fix more typosViktor Szépe2024-01-161-2/+2
|/
* Merge pull request #12437 from ameerj/gl-amd-fixesliamwhite2024-01-043-8/+8
|\ | | | | OpenGL: Fixes and workaround updates for AMD
| * spirv_emit_context: Fix BaseInstance for OGL spirvAmeer J2023-12-221-1/+1
| |
| * emit_glsl_image: Use inlined texelFetch offsetsAmeer J2023-12-212-7/+7
| |
* | shader_recompiler: use default value for clip distances arrayLiam2023-12-271-7/+13
| |
* | shader_recompiler: respect clip distance limits in indexed storeLiam2023-12-271-18/+28
| |
* | shader_recompiler: use float image operations on load/store when requiredLiam2023-12-213-15/+31
| |
* | shader_recompiler: use minimal clip distance arrayLiam2023-12-191-1/+2
| |
* | shader_recompiler: ignore clip distances beyond driver support levelLiam2023-12-192-1/+6
|/
* Merge pull request #12196 from ameerj/glsl-cbuf-sizesliamwhite2023-12-031-1/+4
|\ | | | | GLSL: Use known cbuf sizes when possible
| * GLSL: Prefer known used cbuf sizesAmeer J2023-11-271-1/+4
| |
* | Merge branch 'master' into ssbo-alignAmeer J2023-11-276-36/+40
|\|
| * renderer_vulkan: ignore viewport stores on non-supporting driversLiam2023-11-192-1/+5
| |
| * shader_recompiler: Fix spelling of "derivate" (#12067)Ameer J2023-11-184-35/+35
| |
* | shader_recompiler: Align SSBO offsets in GlobalMemory functionsAmeer J2023-11-013-5/+13
|/
* Merge pull request #11847 from ameerj/glsl-shfl-fixliamwhite2023-10-232-4/+9
|\ | | | | emit_glsl_warp: Fix shfl_in_bounds conditional
| * emit_glsl_warp: Fix shfl_in_bounds conditionalAmeer J2023-10-222-4/+9
| |
* | Manually robust on Maxwell and earlierKelebek12023-10-192-3/+56
|/
* ci: fix new codespell errorsLiam2023-10-031-1/+1
|
* emit_spirv: fix incorrect use of descriptor index in image atomicsLiam2023-09-222-13/+9
|
* shader_recompiler: skip sampler for buffer textures (#11435)liamwhite2023-09-143-6/+2
|
* shader_recompiler: always declare image format for image buffersLiam2023-09-021-1/+6
|
* shader_recompiler: fix emulation of 3D textureGradLiam2023-09-022-3/+34
|
* Shader Recomnpiler: implement textuzreGrad 3D emulation constant propagationFernando Sahmkow2023-08-192-5/+26
|
* emit_glasm: Fix lmem size computationameerj2023-06-261-1/+1
|
* Remove memory allocations in some hot pathsKelebek12023-06-223-3/+3
|
* video_core: Enable support_descriptor_aliasing on Turnip, disable storage atomic otherwise.bunnei2023-06-031-0/+10
|
* Avoid using VectorExtractDynamic for subgroup mask on Adreno GPUsBilly Laws2023-06-031-1/+16
| | | | This crashes their shader compiler for some reason.
* Implement scaled vertex buffer format emulationBilly Laws2023-06-033-49/+72
| | | | These formats are unsupported by mobile GPUs so they need to be emulated in shaders instead.
* Fix Tears of the Kingdom flickering clouds and depths.Kelebek12023-05-111-6/+2
|
* Define SampleMask as an arrayKelebek12023-04-302-2/+4
|
* shader_recompiler: Use vector arithmetic rather than component-wise in ImageGatherSubpixelOffsetWollnashorn2023-04-081-18/+9
| | | | Should be more efficient and better readable
* shader_recompiler: Add subpixel offset for correct rounding at `ImageGather`Wollnashorn2023-04-082-0/+68
| | | | | | | On AMD a subpixel offset of 1/512 of the texel size is applied to the texture coordinates at a ImageGather call to ensure the rounding at the texel centers is done the same way as in Maxwell or other Nvidia architectures. See https://www.reedbeta.com/blog/texture-gathers-and-coordinate-precision/ for more details why this might be necessary. This should fix shadow artifacts at object edges in Zelda: Breath of the Wild (#9957, #6956).
* general: fix spelling mistakesLiam2023-03-122-41/+41
|
* glsl_emit_context: Remove redeclarations of gl_SampleID and gl_SampleMaskameerj2023-02-091-6/+0
| | | | These built-ins seem to be available without needing to be declared for fragment shaders, similar i.e. to gl_FragDepth
* spirv: Fix TXQ with MSAA texturesameerj2023-01-293-8/+19
|
* emit_glasm_image: Fix TXQ with MSAA texturesameerj2023-01-291-1/+9
|
* emit_glsl_image: Implement TXQ with MSAA texturesameerj2023-01-291-9/+23
| | | | Also fixes for texture buffers, which do not have mips eithers.
* Merge pull request #9694 from ameerj/txq-mipsliamwhite2023-01-296-16/+19
|\ | | | | shader_recompiler: TXQ: Skip QueryLevels when possible
| * shader_recompiler: TXQ: Skip QueryLevels when possibleameerj2023-01-286-16/+19
| |
* | Merge pull request #9687 from ameerj/ogl-shader-msbunnei2023-01-294-33/+46
|\ \ | | | | | | glasm, glsl: Implement multisampled Image Fetch
| * | emit_glsl_image: Fix ImageFetch for MSAA texturesameerj2023-01-281-6/+11
| | |
| * | glasm: Add MS sampler typesameerj2023-01-272-5/+8
| | |
| * | glsl: Add MS sampler typesameerj2023-01-271-22/+27
| |/
* | Merge pull request #9682 from ameerj/shader-s32bunnei2023-01-286-15/+0
|\ \ | |/ |/| shader_recompiler: Remove S32 IR type
| * shader_recompiler: Remove S32 IR typeameerj2023-01-266-15/+0
| | | | | | | | | | | | The frontend IR opcodes do not distinguish between signed and unsigned integer types. Fixes broken shaders when IR validation/graphics debugging is enabled for shaders that used BitCastS32F32
* | spirv: fix multisampled image fetchLiam2023-01-232-2/+7
|/
* Run clang-formatBilly Laws2023-01-052-7/+9
|
* shader_recompiler: Fix shuffle partitioning for >64 invoc-per-subgroup GPUsBilly Laws2023-01-051-30/+28
| | | | The existing implementation only supports 64 invoc-per-subgroup GPUs, and misbehaves on adreno when invocations need to be split into 4 emulated subgroups.
* shader_recompiler: SPIRV: Only enable int64 feature when supportedBilly Laws2023-01-051-1/+1
|
* Vulkan: Add a workaround for input_position on Adreno driversBilly Laws2023-01-053-11/+39
| | | | Adreno drivers will crash compiling geometry shaders if the input position is not wrapped in a gl_in struct.
* Video_core: Address feedbackFernando Sahmkow2023-01-045-0/+32
|
* MacroHLE: Add OpenGL SupportFernando Sahmkow2023-01-012-1/+13
|
* MacroHLE: Add HLE replacement for base vertex and base instance.Fernando Sahmkow2023-01-012-0/+18
|
* Merge pull request #7450 from FernandoS27/ndc-vulkanliamwhite2022-12-172-3/+4
|\ | | | | Vulkan: Add support for VK_EXT_depth_clip_control.
| * Vulkan: Add support for VK_EXT_depth_clip_control.FernandoS272022-12-142-3/+4
| |
* | spirv_emit_context: declare GroupNonUniform capability for SubgroupLocalInvocationIdLiam2022-12-141-0/+2
|/
* Vulkan: update initializationLiam2022-11-273-19/+31
| | | | Co-authored-by: bylaws <bylaws@users.noreply.github.com>
* spirv_emit_context: add missing flat decorationLiam2022-11-191-0/+1
|
* Merge pull request #9253 from vonchenplus/attr_layerliamwhite2022-11-195-0/+13
|\ | | | | shader: Implement miss attribute layer
| * shader: Implement miss attribute layerFengChen2022-11-175-0/+13
| |
* | video_core: Fix few issues in Tess stageFengChen2022-11-079-0/+48
|/
* video_core: Fix SNORM texture buffer emulating error (#9001)Feng Chen2022-11-046-1/+16
|
* Merge pull request #8858 from vonchenplus/mipmapbunnei2022-11-0411-1/+67
|\ | | | | video_core: Generate mipmap texture by drawing
| * video_core: Generate mipmap texture by drawingFengChen2022-09-2011-1/+67
| |
* | CMakeLists: Disable C4100 and C4324Morph2022-10-222-8/+0
| | | | | | | | Disabling C4100 is similar to -Wno-unused-parameter
* | shader_recompiler: add extended LDC to GLASM backendLiam2022-10-021-4/+21
| |
* | chore: fix some typosAndrea Pappacoda2022-09-232-2/+2
|/ | | | Fix some typos reported by Lintian
* video_code: support rectangle textureFengChen2022-08-255-0/+6
|
* Add missed shader defines. Fixes Xenoblade Chronicles 3 booting with Vulkan.Kelebek12022-07-291-2/+3
|
* general: Convert source file copyright comments over to SPDXMorph2022-04-2373-219/+146
| | | | | This formats all copyright comments according to SPDX formatting guidelines. Additionally, this resolves the remaining GPLv2 only licensed files by relicensing them to GPLv2.0-or-later.
* shader_compiler: support const buffer indirect addressing in GLSLLiam2022-04-013-8/+37
|
* shader_recompiler: support const buffer indirect addressing on OpenGL SPIR-VLiam2022-04-011-10/+11
|
* Include <bit> header when std::count{r,l}_zero is usedBilly Laws2022-03-223-0/+4
| | | | Needed for compilation with older libc++ releases
* shader_recompiler: Reduce unused includesameerj2022-03-2029-45/+0
|
* Address review commentsLiam2022-03-172-19/+7
|
* shader_recompiler: Use functions for indirect const buffer accessesLiam2022-03-173-39/+91
|
* Address review commentsLiam2022-03-171-16/+15
|
* shader: add support for const buffer indirect addressingLiam2022-03-151-10/+41
|
* emit_spirv, vk_compute_pass: Resolve VS2022 compiler errorsameerj2022-03-121-1/+1
|
* emit_glsl_atomic: Implement 32x2 fallback atomic opsameerj2022-01-301-9/+55
|
* shaders: Add U64->U32x2 Atomic fallback functionsameerj2022-01-306-1/+413
|
* spirv_atomic: Define U32x2 storage buffers for 64-bit storage atomicsameerj2022-01-291-2/+2
| | | | | | Some drivers do not support 64-bit atomics, and fallback to atomically modifying U32x2 vectors. This change ensures that U32x2 storage vectors are defined in the spir-v shader when 64-bit atomics are used. Fixes a hang on some devices, notably Intel GPUs, when booting Pokemon Legends Arceus
* emit_spirv: Add Xfb execution mode when transform feedback is usedameerj2022-01-281-3/+9
| | | | Fixes Transform Feedback on Vulkan AMD drivers.
* shader_recompiler: fix potential OOB accessv19932022-01-172-6/+8
| | | | Found by static analysis with PVS-Studio. Original check wasn't actually checking for OOB and would segfault in case of it.
* glsl: Remove unreachable returnNarr the Reg2022-01-051-1/+0
|
* Merge pull request #7629 from ameerj/nv-driver-fixesFernando S2022-01-0311-23/+96
|\ | | | | shaders: Add fixes for NVIDIA drivers 495+
| * glsl: Add boolean reference workaroundameerj2021-12-302-2/+6
| |
| * glsl_context_get_set: Add alternative cbuf type for broken driversameerj2021-12-302-17/+25
| | | | | | | | some drivers have a bug bitwise converting floating point cbuf values to uint variables. This adds a workaround for these drivers to make all cbufs uint and convert to floating point as needed.
| * emit_glsl_integer: Use negation work aroundameerj2021-12-301-2/+2
| |
| * shader: Add integer attribute get optimization passameerj2021-12-306-0/+61
| | | | | | | | Works around an nvidia driver bug, where casting the integer attributes to float and back to an integer always returned 0.
| * emit_glsl_floating_point: Fix FPNeg on newer Nvidia driversameerj2021-12-251-2/+2
| |
* | Merge pull request #7618 from goldenx86/patch-4bunnei2021-12-291-0/+9
|\ \ | | | | | | Increase boost requirement to 1.78.0
| * | Empty spacesMatías Locatti2021-12-281-1/+1
| | |
| * | Changes to avoid warnings in SSE4.2 optimized SPIR-VMatías Locatti2021-12-281-0/+9
| |/
* / emit_glasm_context_get_set: Fix GetAttribute return value type.ameerj2021-12-251-4/+4
|/ | | | GetAttribute expects an F32 result type at the IR level, this fixes the return value of attributes which were not returning an F32
* Remove spirv handle legacy related codevonchenplus2021-12-183-189/+0
|
* Remove glsl handle legacy related codevonchenplus2021-12-183-103/+1
|
* emit_spirv: Reduce emit_spirv.h include overheadameerj2021-12-0620-3/+20
| | | | emit_spirv.h is included in video_core, which was propagating further includes that video_core did not depend on.
* glasm: Move implemented instructions from not_implemented.cppameerj2021-12-067-169/+220
|
* shader_recompiler: Adjust emit_context includesameerj2021-12-0637-37/+37
|
* shader_recompiler: Rename backend emit_context filesameerj2021-12-056-0/+0
|
* vulkan: Fix rescaling push constant usageameerj2021-11-164-34/+36
|
* shader, video_core: Fix GCC build errorsameerj2021-11-161-4/+0
|
* emit_spirv: Fix RescalingLayout alignmentameerj2021-11-161-0/+1
|
* emit_spirv: Fix RescalingLayout alignmentameerj2021-11-161-2/+2
|
* shader: Properly scale image reads and add GL SPIR-V supportReinUsesLisp2021-11-1614-47/+163
| | | | Thanks for everything!
* glsl/glasm: Pass and use scaling parameters in shadersReinUsesLisp2021-11-166-7/+11
|
* gl_graphics_pipeline: Add downscale factor to shader uniformsameerj2021-11-163-4/+5
|
* spirv: Implement rescaling patchingReinUsesLisp2021-11-165-5/+72
|
* shader: Add IsTextureScaled opcodeReinUsesLisp2021-11-166-0/+24
|
* shader: Add integer division opcodesReinUsesLisp2021-11-166-0/+30
|
* shader: Add resolution down factor opcodeReinUsesLisp2021-11-166-0/+18
|
* Merge pull request #7260 from vonchenplus/spirv_support_legacy_attribute_v2bunnei2021-11-143-71/+153
|\ | | | | shader: Spirv support legacy attribute v2
| * Simply legacy attribute implementFeng Chen2021-11-043-152/+125
| |
| * Support gl_FogFragCoord attributevonchenplus2021-10-313-48/+58
| |
| * Support gl_BackSecondaryColor attributevonchenplus2021-10-263-0/+33
| |
| * Support gl_FrontSecondaryColor attributevonchenplus2021-10-263-0/+33
| |
| * Support gl_BackColor attributevonchenplus2021-10-263-0/+33
| |
* | Merge pull request #7262 from FernandoS27/Buffalo-buffalo-Buffalo-buffalo-buffalobunnei2021-11-032-3/+3
|\ \ | | | | | | ShaderCache: Order Phi Arguments from farthest away to nearest.
| * | Shader Cahe: Fix Phi Nodes on GLASM.Fernando Sahmkow2021-11-021-1/+1
| | |
| * | ShaderCache: Fix Phi Nodes Type on OGL.Fernando Sahmkow2021-11-011-2/+2
| |/
* / emit_spirv_image: Fix depth image implicit lod sample in computeameerj2021-10-171-5/+16
|/ | | | Ensures all drivers behave the same way in this case.
* style: Remove extra space preceding the :: operatorMorph2021-09-291-2/+2
|
* Spir-V: Rescale the frag depth to 0,1 mode when -1,1 mode is used in Vulkan.Fernando Sahmkow2021-09-151-1/+7
|
* Merge pull request #6948 from ameerj/amd-warp-fixMorph2021-09-122-54/+109
|\ | | | | shaders: Fix warp instructions on 64-thread warp devices
| * emit_glsl_warp: Fix shuffle ops for 64-thread warp sizesameerj2021-08-311-24/+36
| |
| * emit_glsl_warp: Fix ballot related ops for 64-thread warp sizesameerj2021-08-311-24/+38
| |
| * emit_spirv_warp: Fix shuffle ops for 64-thread warp sizesameerj2021-08-311-1/+29
| |
| * emit_spirv_warp: Fix ballot related ops for 64-thread warp sizesameerj2021-08-311-10/+11
| |
* | Merge pull request #6962 from vonchenplus/spirv_support_legacy_attributebunnei2021-09-083-0/+107
|\ \ | |/ |/| renderer_vulkan: Spirv support glsl legacy attribute
| * Detail adjustmentFeng Chen2021-09-081-13/+14
| |
| * Detail adjustmentFeng Chen2021-09-082-28/+35
| |
| * Re-implement get unused locationFeng Chen2021-09-071-30/+30
| |
| * Move attribute related definitions to spirv anonymous namespaceFeng Chen2021-09-072-5/+26
| |
| * Dynamic get unused locationFeng Chen2021-09-061-27/+49
| |
| * Implement intput and output fixed fnc texturesFeng Chen2021-09-063-19/+23
| |
| * Rename parametersFeng Chen2021-09-033-9/+9
| |
| * Fix create GraphicsPipelines crashFeng Chen2021-09-031-5/+5
| |
| * Add input/output locationFeng Chen2021-09-021-5/+13
| |
| * Add colorfront and txtcoord supportFeng Chen2021-08-313-0/+44
| |
* | emit_spirv_context_get_set: Fix Get FrontFace return valueameerj2021-08-271-2/+3
|/ | | | The IR expects GetAttribute to return an F32 value. This case was returning a U32 instead.
* SPIR-V: Merge two ifs in EmitGetAttributeValeri2021-08-191-6/+2
|
* Merge pull request #6726 from lioncash/hguardRodrigo Locatti2021-07-261-0/+2
|\ | | | | emit_spirv_instructions: Add missing header guard
| * emit_spirv_instructions: Add missing header guardLioncash2021-07-261-0/+2
| |
* | emit_glasm: Fix LINESS_ADJACENCY typo in InputPrimitive()Lioncash2021-07-261-1/+1
|/ | | | This should be LINES_ADJACENCY
* shader: Fix disabled attribute default valuesameerj2021-07-231-1/+1
|
* glsl: Simplify FCMP emissionameerj2021-07-231-6/+4
|
* glsl: Update TessellationControl gl_inameerj2021-07-231-0/+28
| | | | Adheres to GL_ARB_separate_shader_objects requirements
* shader: Avoid usage of C++20 ranges to build in clangReinUsesLisp2021-07-232-4/+12
|
* glsl: Clamp shared mem size to GL_MAX_COMPUTE_SHARED_MEMORY_SIZEameerj2021-07-231-2/+9
|
* shader_recompiler, video_core: Resolve clang errorslat9nq2021-07-234-9/+13
| | | | | | | | | | Silences the following warnings-turned-errors: -Wsign-conversion -Wunused-private-field -Wbraced-scalar-init -Wunused-variable And some other errors
* shader: Ignore global memory ops on devices lacking int64 supportameerj2021-07-234-16/+62
|
* emit_spirv: Workaround VK_KHR_shader_float_controls on fp16 NvidiaReinUsesLisp2021-07-231-5/+8
| | | | Fix regression on Fire Emblem: Three Houses when using native fp16.
* shader: GCC fmt 8.0.0 fixeslat9nq2021-07-235-11/+14
|
* shader: Fix disabled and unwritten attributes and varyingsReinUsesLisp2021-07-232-3/+11
|
* glsl: Fix shared and local memory declarationsameerj2021-07-231-3/+3
| | | | account for the fact that program.*memory_size is in units of bytes.
* opengl: Implement LOP.CCameerj2021-07-232-6/+38
| | | | Used by MH:Rise
* spirv: Fix code emission when descriptor aliasing is unsupportedReinUsesLisp2021-07-231-1/+2
| | | | Fixes OpenGL.
* glsl: Declare local memory in mainameerj2021-07-231-3/+3
|
* glsl: Add passthrough geometry shader supportameerj2021-07-233-7/+27
|
* glasm: Add passthrough geometry shader supportReinUsesLisp2021-07-232-7/+26
|
* shader: Rework varyings and implement passthrough geometry shadersReinUsesLisp2021-07-2311-97/+116
| | | | | | Put all varyings into a single std::bitset with helpers to access it. Implement passthrough geometry shaders using host's.
* glsl: Better IAdd Overflow CC fixameerj2021-07-232-11/+13
| | | | This ensures the original operand values are not overwritten when being used in the overflow detection.
* shader: Remove IAbs64ReinUsesLisp2021-07-236-15/+0
|
* glsl: Fix IADD CCameerj2021-07-232-5/+7
|
* shader: Move loop safety tests to code emissionReinUsesLisp2021-07-2311-32/+48
|
* glsl: Remove frag color initializationameerj2021-07-231-9/+0
|
* glasm: Implement SetAttribute ViewportMaskameerj2021-07-232-1/+10
|
* emit_glsl_special: Skip initialization of frag_color0ameerj2021-07-231-1/+1
| | | | Fixes rendering in Devil May Cry without regressing Ori and the Blind Forest.
* glsl: Add missing ; in EmitSetSampleMaskMorph2021-07-231-1/+1
| | | | Fixes shader compilation in Okami HD
* glsl: Fix output varying initialization when transform feedback is usedameerj2021-07-231-3/+37
|
* spirv: Properly handle devices without int8 and int16ReinUsesLisp2021-07-232-39/+67
|
* spirv: Handle small storage buffer loads on devices with no supportReinUsesLisp2021-07-232-6/+6
|
* glsl: Fix cbuf component indexing bug falbackameerj2021-07-231-7/+6
|
* shader: Properly manage attributes not written from previous stagesReinUsesLisp2021-07-236-30/+26
|
* glsl: Only declare fragment outputs on fragment shadersReinUsesLisp2021-07-231-4/+6
|
* shader: Split profile and runtime info headersReinUsesLisp2021-07-238-4/+7
|
* glsl: Obey need_declared_frag_colors to declare and initialize all frag_colorameerj2021-07-232-1/+10
| | | | Fixes Ori and the blind forest title screen
* glsl: Address rest of feedbackameerj2021-07-234-19/+51
|
* glsl: Move gl_Position/generic attribute initialization to EmitProlgueameerj2021-07-232-14/+12
|
* glsl: Conditionally use fine/coarse derivatives based on device supportameerj2021-07-232-4/+27
|
* glsl: Cleanup/Address feedbackameerj2021-07-238-28/+20
|
* glsl: Add Shader_GLSL loggingameerj2021-07-233-28/+32
|
* glsl: Add LoopSafety instructionsameerj2021-07-232-0/+10
|
* glsl: Conditionally add EXT_texture_shadow_lodameerj2021-07-231-3/+5
|
* glsl: Add stubs for sparse queries and variable aoffi when not supportedameerj2021-07-233-13/+37
|
* glsl: Implement legacy varyingsameerj2021-07-234-7/+71
|
* glsl: Minor cleanupameerj2021-07-232-19/+15
|
* glsl: Fix Cbuf getters for F32 typeameerj2021-07-231-12/+15
|
* glsl: Add immediate index oob checking for Cbuf gettersameerj2021-07-231-0/+16
|
* glsl: Refactor GetCbuf functions to reduce code duplicationameerj2021-07-231-104/+66
|
* glsl: Address more feedback. Implement indexed texture readsameerj2021-07-235-111/+109
|
* glsl: Remove Signed Integer variablesameerj2021-07-238-43/+13
|
* glsl: Address Rodrigo's feedbackameerj2021-07-2313-75/+87
|
* glsl: Reorganize backend code, remove unneeded [[maybe_unused]]ameerj2021-07-2311-314/+251
|
* glsl: Implement SampleId and SetSampleMaskameerj2021-07-233-30/+35
| | | | plus some minor refactoring of implementations
* glsl: Add gl_PerVertex in for GSameerj2021-07-231-1/+2
|
* glsl: Use existing tracking for enabling EXT_shader_image_load_formattedameerj2021-07-231-15/+1
|
* glsl: Enable early fragment testsameerj2021-07-232-4/+7
|
* glsl: Implement more attribute getters and settersameerj2021-07-232-12/+60
|
* glsl: Implement fswzaddameerj2021-07-233-5/+43
| | | | and wip nv thread shuffle impl
* glsl: Implement indexed attribute loadsameerj2021-07-235-29/+64
|
* glsl: Conditionally add GL_ARB_sparse_texture2ameerj2021-07-231-2/+3
|
* glsl: Conditionally use GL_EXT_shader_image_load_formattedameerj2021-07-231-2/+18
| | | | Fix for SULD.D
* glsl: Remove output generic indexing for geometry stageameerj2021-07-231-5/+3
|
* glsl: Allow dynamic tracking of variable allocationameerj2021-07-233-21/+35
|
* glsl: Implement barriersameerj2021-07-233-13/+21
|
* glsl: Implement image atomics and set layerameerj2021-07-234-152/+202
| | | | along with some more cleanup/oversight fixes
* glsl: Fix image gather logicameerj2021-07-231-0/+4
|
* glsl: Add cbuf access workaround for devices with component indexing bugameerj2021-07-231-51/+110
|
* glsl: Use textureGrad fallback when EXT_texture_shadow_lod is unsupportedameerj2021-07-232-8/+40
|
* emit_glsl_image: Use immediate offsets when possibleameerj2021-07-231-12/+33
|
* glsl: Fix <32-bit SSBO writesameerj2021-07-234-50/+43
| | | | and more cleanup
* glsl: Cleanup and address feedbackameerj2021-07-2310-86/+69
|
* glsl: Refactor Global memory functionsameerj2021-07-232-71/+73
|
* glsl: Increase NUM_VARS that can be allocatedameerj2021-07-231-1/+1
| | | | needed for HW:AoC.
* glsl: Implement Load/WriteGlobalameerj2021-07-239-98/+185
| | | | along with some other misc changes and fixes
* glsl: Implement Imagesameerj2021-07-232-9/+74
|
* glsl: skip gl_ViewportIndex write if device does not support itameerj2021-07-233-8/+16
|
* glsl: Implement transform feedbackameerj2021-07-233-13/+63
|
* glsl: Yet another gl_ViewportIndex fix attemptameerj2021-07-231-3/+19
|
* glsl: Add gl_ViewportIndex out attributeameerj2021-07-231-1/+3
|
* emit_glsl_context_get_set: Remove unused functionlat9nq2021-07-231-4/+0
|
* glsl: Fix precise variable declarationameerj2021-07-233-24/+25
| | | | and add some more separation in the shader for better debugability when dumped
* glsl: Implement tessellation shadersameerj2021-07-235-27/+146
|
* glsl: Implement ImageGradient and other texture function variantsameerj2021-07-232-32/+73
|
* glsl: Fix atomic SSBO offsetsameerj2021-07-234-67/+74
| | | | and implement misc getters
* glsl: Implement geometry shadersameerj2021-07-234-9/+62
|
* glsl: Use NotImplemented macro with function name outputameerj2021-07-2310-104/+103
|
* glsl: Implement gl_ViewportIndexameerj2021-07-233-5/+14
| | | | SSBU now working
* glsl: SHFL fix and prefer shift operations over divide in glsl shaderameerj2021-07-235-63/+64
|
* glsl: Implement precise fp variable allocationameerj2021-07-234-8/+67
|
* HACK glsl: Write defaults to unused generic attributesameerj2021-07-232-2/+11
|
* glsl: Fix ssbo indexing and name shadowing between shader stagesameerj2021-07-233-77/+101
|
* glsl: implement set clip distanceameerj2021-07-232-0/+15
| | | | and missed a diff in emit_glsl relating to var alloc ref counting
* glsl: Rework var alloc to not assign unused resultsameerj2021-07-239-49/+91
|
* glsl: Rework variable allocator to allow for variable reuseameerj2021-07-2313-351/+480
|
* glsl: Fix ATOM and implement ATOMSameerj2021-07-234-114/+130
|
* glsl: Use gl_SubGroupInvocationARBameerj2021-07-232-8/+7
|
* glsl: Implement VOTE for subgroup size potentially largerameerj2021-07-232-19/+36
|
* glsl: Implement VOTEameerj2021-07-234-50/+64
|
* glsl: Implement ST{LS}ameerj2021-07-236-69/+106
|
* glsl: Implement more instructions used by SMOameerj2021-07-231-3/+3
|
* glsl: Implement more instructions used by SMOameerj2021-07-235-10/+16
|
* glsl: Fix GetAttribute return valuesameerj2021-07-232-7/+9
| | | | fixes font rendering issues as these were used to index into the ssbos
* glsl: minor cleanupameerj2021-07-234-20/+19
|
* glsl: Fix and implement rest of cbuf accessameerj2021-07-231-7/+43
|
* glsl: Implement TXQ and other misc changesameerj2021-07-235-6/+36
|
* glsl: TLD4 implementationameerj2021-07-231-2/+89
|
* glsl: Implement TLD instructionameerj2021-07-231-1/+55
|
* glsl: Implement TEXSameerj2021-07-231-1/+29
|
* glsl: Cleanup texture functionsameerj2021-07-231-13/+11
|
* shader_recompiler: GCC fixeslat9nq2021-07-2314-3/+13
|
* glsl: Implement TEX depth functionsameerj2021-07-232-4/+46
|
* glsl: Implement TEX ImageSample functionsameerj2021-07-233-11/+71
|
* glsl: Rework Shuffle emit instructions to align with SPIR-Vameerj2021-07-231-19/+40
|
* glsl: Better Storage access and wip warpsameerj2021-07-238-62/+133
|
* glsl: Fix integer conversions, implement clamp CCameerj2021-07-232-27/+36
|
* glsl: Implement IADD CCameerj2021-07-232-2/+17
|
* glsl: SSBO access fixes and wip SampleExplicitLod implementation.ameerj2021-07-232-4/+19
|
* glsl: WIP var forward declarationameerj2021-07-236-49/+60
| | | | to fix Loop control flow.
* glsl: Fix bindings, add some CC opsameerj2021-07-238-57/+91
|
* glsl: remove unused headersameerj2021-07-2314-34/+10
|
* glsl: Implement derivatives and YDirectionameerj2021-07-238-81/+87
| | | | plus some other misc additions/changed
* glsl: Fix non-immediate buffer accessameerj2021-07-2312-72/+133
| | | | and many other misc implementations
* glsl: textures wipameerj2021-07-239-75/+139
|
* glsl: Implement some attribute getters and settersameerj2021-07-239-191/+337
|
* glsl: Track S32 atomicsameerj2021-07-231-3/+6
|
* glsl: Update phi node managementameerj2021-07-234-21/+53
|
* glsl: Fix floating point compare opsameerj2021-07-231-28/+28
| | | | Logic for ordered/unordered ops was wrong.
* glsl: Query GL Device for FP16 extension supportameerj2021-07-231-2/+7
|
* glsl: Simply FP storage atomicsameerj2021-07-232-48/+28
|
* glsl: F16x2 storage atomicsameerj2021-07-237-58/+64
|
* glsl: Revert ssbo aliasing. Storage Atomics implameerj2021-07-234-68/+132
|
* glsl: implement phi nodesameerj2021-07-234-20/+54
|
* glsl: Wip storage atomic opsameerj2021-07-238-325/+406
|
* glsl: Implement FCMPameerj2021-07-233-242/+185
|
* glsl: Add a more robust fp formatterameerj2021-07-234-9/+14
|
* glsl: More FP fixesameerj2021-07-232-9/+16
|
* glsl: FP function fixesameerj2021-07-237-17/+25
|
* glsl: More FP instructions/fixesameerj2021-07-235-28/+41
|
* glsl: Add many FP32/64 instructionsameerj2021-07-2312-765/+1011
|
* glsl: Implement more Integer opsameerj2021-07-233-119/+72
|
* glsl: Implement BF*ameerj2021-07-233-9/+10
|
* glsl: Implement a few Integer instructionsameerj2021-07-2310-260/+398
|
* glsl: Use std::string_view for Emit function args.ameerj2021-07-236-760/+838
|
* glsl: Pass IR::Inst& to Emit functionsameerj2021-07-236-171/+169
|
* glsl: INeg and IAdd negate testsameerj2021-07-233-94/+106
|
* glsl: Reusable typed variables. IADD32ameerj2021-07-236-203/+311
|
* glsl: Fix program linking and cbufameerj2021-07-232-3/+5
|
* glsl: Fix "reg" allocingameerj2021-07-2310-898/+938
| | | | based on glasm with some tweaks
* glsl: Initial backendameerj2021-07-2326-0/+3266
|
* spirv: Reduce log severity of mismatching denorm rulesReinUsesLisp2021-07-231-2/+2
|
* shader: Add loggingReinUsesLisp2021-07-239-21/+23
|
* shader: Add shader loop safety check settingslat9nq2021-07-234-8/+28
| | | | Also add a setting for enable Nsight Aftermath.
* spirv/convert: Catch more signed operations oversightsameerj2021-07-231-5/+5
| | | | The sign bit on integers of size < 32 was not properly preserved in casts
* spirv/convert: Catch more broken signed operations on Nvidia OpenGLReinUsesLisp2021-07-231-0/+6
| | | | | BitCast U32 to S32 before converting to float on drivers with broken signed operations.
* glasm: Use ARB_derivative_control conditionallyReinUsesLisp2021-07-232-7/+29
|
* emit_glasm_context_get_set: Remove unused variablelat9nq2021-07-231-1/+0
|
* shader,glasm: Implement legacy texcoord loadsReinUsesLisp2021-07-231-2/+10
|
* glasm: Implement legacy varyingsReinUsesLisp2021-07-231-17/+56
|
* shader: Add support for "negative" and unaligned offsetsReinUsesLisp2021-07-232-6/+11
| | | | | | | | | "Negative" offsets don't exist. They are shown as such due to a bug in nvdisasm. Unaligned offsets have been proved to read the aligned offset. For example, when reading an U32, if the offset is 6, the offset read will be 4.
* spirv: Fix output generics with componentsReinUsesLisp2021-07-231-1/+1
|
* opengl: Declare fragment outputs even if they are not usedReinUsesLisp2021-07-233-10/+5
| | | | | | Fixes Ori and the Blind Forest's menu on GLASM. For some reason (probably high level optimizations) it is not sanitized on SPIR-V for OpenGL. Vulkan is unaffected by this change.
* spirv: Fix image and image buffer descriptor index usageReinUsesLisp2021-07-231-5/+7
|
* glasm: Fix immediate texture coordinateReinUsesLisp2021-07-231-0/+1
|
* glasm: Reduce reg allocation leaks from an exception to a logReinUsesLisp2021-07-231-1/+1
|
* glasm: Use integer lod for TXQReinUsesLisp2021-07-232-2/+2
|
* glasm: Fix global memory fallbacksReinUsesLisp2021-07-231-9/+10
|
* Revert "glasm: Skip phi moves on undefined instructions"ReinUsesLisp2021-07-231-3/+1
| | | | Causes regressions on Bowser's Fury.
* glasm: Remove unintentional '\n' on Undef32ReinUsesLisp2021-07-231-1/+1
|
* glasm: Use storage buffers instead of global memory when possibleReinUsesLisp2021-07-234-369/+380
|
* glasm: Implement Y directionReinUsesLisp2021-07-234-3/+9
|
* glasm: Skip phi moves on undefined instructionsReinUsesLisp2021-07-231-1/+3
|
* glasm: Implement undef instructionsReinUsesLisp2021-07-232-15/+15
|
* glasm: Fix global memory callbacksReinUsesLisp2021-07-231-5/+6
|
* glasm: Release phi node registers after they are no longer neededReinUsesLisp2021-07-232-38/+54
|
* glasm: Fix INeg32 on negative immediatesReinUsesLisp2021-07-231-1/+5
|
* glasm: Remove unnecessary value typesReinUsesLisp2021-07-233-47/+6
|
* glasm: Throw when there are register leaksReinUsesLisp2021-07-232-0/+7
|
* glasm: Catch more register leaksReinUsesLisp2021-07-238-41/+114
| | | | | | | | | | | | | Add support for null registers. These are used when an instruction has no usages. This comes handy when an instruction is only used for its CC value, with the caveat of having to invalidate all pseudo-instructions before defining the instruction itself in the register allocator. This commits changes this. Workaround a bug on Nvidia's condition codes conditional execution using branches.
* glasm: Fix usage counting on phi nodesReinUsesLisp2021-07-233-8/+22
|
* glasm: Implement global memory fallbacksReinUsesLisp2021-07-232-50/+89
|
* glasm: Implement int64 add and subtractReinUsesLisp2021-07-232-8/+6
|
* emit_glasm_context_get_set: Remove unused variablelat9nq2021-07-231-1/+0
|
* glasm: Implement indirect attribute loadsReinUsesLisp2021-07-234-6/+65
|
* glasm: Implement image atomicsReinUsesLisp2021-07-232-165/+153
|
* glasm: Reorder unreachable image atomic instsReinUsesLisp2021-07-231-66/+66
| | | | Reorder them to the bottom of the file for readability.
* glasm: Implement gl_Layer storesReinUsesLisp2021-07-231-0/+7
|
* glasm: Implement SampleIdReinUsesLisp2021-07-232-3/+3
|
* glasm: Implement IsHelperInvocationReinUsesLisp2021-07-232-3/+3
|
* glasm: Fix EmitVertex's optimizationReinUsesLisp2021-07-231-1/+1
|
* gl_shader_cache,glasm: Conditionally use typeless image reads extensionReinUsesLisp2021-07-231-2/+4
|
* glasm: Implement forced early ZReinUsesLisp2021-07-231-2/+6
|
* glasm: Simplify patch readsReinUsesLisp2021-07-231-5/+2
|
* glasm: Fix output patch readsReinUsesLisp2021-07-232-13/+22
| | | | With this, Luigi's Mansion's sand renders properly.
* shader: Split profile and runtime information in separate structsReinUsesLisp2021-07-2310-66/+80
|
* emit_glasm_context_get_and_set.cpp: Add missing semicolonsameerj2021-07-231-2/+2
|
* glasm: Fix patch attribute declarationsReinUsesLisp2021-07-231-1/+1
|
* glasm: Implement FSWZADDameerj2021-07-233-4/+28
|
* glasm: Implement PrimitiveId attribute readReinUsesLisp2021-07-231-0/+3
|
* glasm: Implement clip distance storesReinUsesLisp2021-07-232-0/+15
|
* glasm: Fix tessellation input attributesReinUsesLisp2021-07-231-2/+5
|
* glasm: Add missing semicolon on tesscoord readingReinUsesLisp2021-07-231-1/+1
|
* glasm: Fix tessellation headersReinUsesLisp2021-07-231-2/+2
|
* glasm: Add tessellation shader declarationsReinUsesLisp2021-07-231-0/+35
|
* glasm: Implement TessellationEvaluationPointReinUsesLisp2021-07-231-0/+4
|
* glasm: Implement patch memoryReinUsesLisp2021-07-233-6/+51
|
* glasm: Fix InvocationId declarationReinUsesLisp2021-07-231-1/+1
|
* glasm: Implement InvocationIdReinUsesLisp2021-07-232-2/+5
|
* glasm: Optimize EmitVertex into EMITReinUsesLisp2021-07-231-1/+5
|
* glasm: Implement geometry shader attribute readsReinUsesLisp2021-07-232-4/+18
|
* glasm: Properly declare attributes on geometry programsReinUsesLisp2021-07-233-6/+14
|
* glasm: Declare geometry program headersReinUsesLisp2021-07-231-0/+35
|
* glasm: Fix potential aliasing bug on cube array samplesReinUsesLisp2021-07-232-35/+44
|
* glasm: Implement ImageWriteReinUsesLisp2021-07-231-4/+7
|
* glasm: Implement ImageReadReinUsesLisp2021-07-234-4/+56
|
* glasm: Implement EmitVertex and EndPrimitiveReinUsesLisp2021-07-232-4/+8
|
* glasm: Implement ImageGradientReinUsesLisp2021-07-232-7/+65
|
* glasm: Implement 64-bit shiftsReinUsesLisp2021-07-232-12/+14
|
* glasm: Implement barriersReinUsesLisp2021-07-231-3/+3
|
* glasm: Fix compute stage nameReinUsesLisp2021-07-231-1/+1
|
* glasm: Fix phi instruction typesReinUsesLisp2021-07-231-1/+1
|
* glasm: Implement PREC on relevant instructionsReinUsesLisp2021-07-231-6/+12
|
* glasm: Implement stores to gl_ViewportIndexReinUsesLisp2021-07-234-7/+29
|
* glasm: Implement gl_PointSize storesReinUsesLisp2021-07-231-0/+3
|
* glasm: Implement gl_PointCoordReinUsesLisp2021-07-231-0/+4
|
* glasm: Implement ImageQueryLodReinUsesLisp2021-07-231-3/+5
|
* glasm: Implement ImageFetchReinUsesLisp2021-07-234-13/+38
|
* glasm: Implement IADD.CCameerj2021-07-231-1/+26
|
* glasm: Implement BFE.CCReinUsesLisp2021-07-231-0/+8
|
* glasm: Implement SelectU1ReinUsesLisp2021-07-232-4/+5
|
* glasm: Implement gl_WorkGroupIDReinUsesLisp2021-07-232-3/+3
|
* glasm: Implement TXQ and improve texture info readsReinUsesLisp2021-07-232-50/+51
|
* glasm: Implement gl_FrongFacing attributeReinUsesLisp2021-07-231-0/+3
|
* glasm: Support textures used in more than one stageReinUsesLisp2021-07-233-4/+24
|
* glasm: Implement textureGather instructionsReinUsesLisp2021-07-232-15/+97
|
* glasm: Implement gl_FragDepth and gl_SampleMask storesReinUsesLisp2021-07-232-5/+5
|
* glasm: Do not alias ConditionRef for nowReinUsesLisp2021-07-232-3/+2
| | | | | Immediate condition refs where not handled correctly. Just move the value for now.
* shader: Read branch conditions from an instructionReinUsesLisp2021-07-237-5/+20
| | | | Fixes the identity removal pass.
* glasm: Implement InstanceId and VertexIdReinUsesLisp2021-07-231-0/+6
|
* glasm: Add missing return value on move assignmentReinUsesLisp2021-07-231-0/+1
|
* glasm: Fix aliased bitcasts ref countingReinUsesLisp2021-07-233-13/+42
|
* glasm: Remove unintentional comma on vector insertReinUsesLisp2021-07-231-1/+1
|
* glasm: Implement TEX and TEXS instructionsReinUsesLisp2021-07-236-47/+258
| | | | | Remove lod clamp from texture instructions with lod, as this is not needed (nor supported).
* glasm: Add support for non-2D texture samplesReinUsesLisp2021-07-231-4/+26
|
* glasm: Reorder unreachable image instructions to the bottomReinUsesLisp2021-07-231-97/+97
|
* glasm: Add support for texture offsetsReinUsesLisp2021-07-231-11/+15
|
* glasm: Improve texture sampling instructionsReinUsesLisp2021-07-232-50/+70
|
* emit_glasm: Enable ARB_draw_buffers when neededReinUsesLisp2021-07-232-1/+5
|
* emit_glasm: Add support for reading position attributesReinUsesLisp2021-07-231-3/+13
|
* shader_recompiler: GCC fixeslat9nq2021-07-233-25/+25
| | | | | Fixes members of unnamed union not being accessible, and one function without a declaration.
* glasm: Implement rest of shared memameerj2021-07-232-35/+29
|
* emit_spirv: Jump to loop body with local variableReinUsesLisp2021-07-231-1/+1
| | | | Silence unused variable warning
* glasm: Implement derivative instructions on GLASMReinUsesLisp2021-07-232-12/+12
|
* glasm: Initial (broken) implementation of TEX on GLASMReinUsesLisp2021-07-233-299/+386
|
* glasm: Implement some graphics instructions on GLASMReinUsesLisp2021-07-232-6/+5
|
* glasm: Add Void type to GLASM valuesReinUsesLisp2021-07-233-0/+15
|
* glasm: Add graphics specific shader declarations to GLASMReinUsesLisp2021-07-232-6/+63
|
* glasm: Implement local memory for glasmameerj2021-07-234-9/+12
|
* emit_spirv: Add missing block in caseReinUsesLisp2021-07-231-1/+2
|
* glasm: Initial implementation of phi nodes on GLASMReinUsesLisp2021-07-235-10/+88
|
* glasm: Write result to scalar on integer comparison instructionsReinUsesLisp2021-07-231-10/+10
|
* glasm: Declare NV_shader_thread_group when neededReinUsesLisp2021-07-231-3/+4
|
* glasm: Rework control flow introducing a syntax listReinUsesLisp2021-07-239-88/+116
| | | | | This commit regresses VertexA shaders, their transformation pass has to be adapted to the new control flow.
* glasm: Implement Storage atomicsameerj2021-07-235-109/+156
| | | | | StorageAtomicExchangeU64 is failing test seemingly due to failure storing 64-bit result into the register
* glasm: Ensure reg alloc order across compilers on GLASMReinUsesLisp2021-07-231-11/+14
| | | | | | | | | | | | | | | | | | Use a struct constructor to serialize register allocation arguments to ensure registers are allocated in the same order regardless of the compiler used. The A and B functions can be called in any order when passed as arguments to "foo": foo(A(), B()) But the order is guaranteed for curly-braced constructor calls in classes: Foo{A(), B()} Use this to get consistent behavior.
* glasm: Enable unintentionally disabled register aliasing on GLASMReinUsesLisp2021-07-231-16/+11
|
* glasm: Review all GLASM insts to be aware of register aliasingReinUsesLisp2021-07-234-20/+51
|
* glasm: Implement shuffle and vote instructions on GLASMReinUsesLisp2021-07-238-99/+162
|
* glasm: Add MUFU instructions to GLASMReinUsesLisp2021-07-232-21/+22
|
* glasm: Implement IAbs64 and INeg64 on GLASMReinUsesLisp2021-07-232-6/+6
|
* glasm: Properly clamp Fp64 on GLASMReinUsesLisp2021-07-231-6/+6
|
* glasm: Fix register allocation when moving immediate on GLASMReinUsesLisp2021-07-233-42/+89
|
* glasm: Implement SelectU64 on GLASMReinUsesLisp2021-07-232-4/+20
|
* glasm: Fix clamps so the min value has priority on NAN on GLASMReinUsesLisp2021-07-231-12/+15
|
* glasm: Fix moving U64 immediates to registers in GLASMReinUsesLisp2021-07-232-3/+4
|
* glasm: Implement storage atomic opsameerj2021-07-234-305/+358
|
* glasm: Add conversion instructions to GLASMReinUsesLisp2021-07-239-282/+351
|
* glasm: Add fp min/max insts and fix store for fp64 on GLASMReinUsesLisp2021-07-232-10/+8
|
* glasm: Add logical instructions on GLASMReinUsesLisp2021-07-232-12/+12
|
* glasm: Remove duplicated Fp64 pack instructions on GLASMReinUsesLisp2021-07-231-8/+0
|
* glasm: Remove unnecesary new white space on Clamp GLASMReinUsesLisp2021-07-231-4/+4
|
* glasm: Add floating-point comparisons on GLASMReinUsesLisp2021-07-233-120/+116
|
* emit_glasm: Implement more integer alu opsameerj2021-07-232-47/+41
|
* glasm: Reimplement bitwise ops and BFI/BFEameerj2021-07-234-88/+108
|
* glasm: Initial GLASM fp64 supportReinUsesLisp2021-07-239-55/+152
|
* glasm: Implement GLASM fp16 packing and move bitwise insnsReinUsesLisp2021-07-234-66/+77
|
* glasm: Remove unused functions left from rebaseReinUsesLisp2021-07-231-12/+0
|
* glasm: Specify namespace when using FormatToReinUsesLisp2021-07-231-6/+6
|
* glasm: Implement more GLASM composite instructionsReinUsesLisp2021-07-232-54/+63
|
* glasm: Make GLASM aware of typesReinUsesLisp2021-07-2312-1244/+1380
|
* glasm: Use CMP.S for Select32ameerj2021-07-233-12/+8
| | | | also fixes ADD and SUB to use U modifier
* glasm: Implement more logical opsameerj2021-07-232-5/+5
|
* glasm: Implement BFI, BFEameerj2021-07-234-138/+164
| | | | Along with implementations of common instructions along the way
* glasm: Use BitField instead of C bitfieldsReinUsesLisp2021-07-232-8/+12
|
* glasm: Remove unused argument in identity instructions on GLASMReinUsesLisp2021-07-231-7/+7
|
* glasm: Implement basic GLASM instructionsReinUsesLisp2021-07-239-840/+1167
|
* glasm: Changes to GLASM register allocator and emit contextReinUsesLisp2021-07-234-26/+64
|
* glasm: Add GLASM backend infrastructureReinUsesLisp2021-07-2326-0/+3079
|
* shader: Fixup SPIR-V emit header namespacesReinUsesLisp2021-07-231-2/+2
|
* Move SPIR-V emission functions to their own headerReinUsesLisp2021-07-2323-572/+629
|
* shader: Optimize NVN FallthroughFernandoS272021-07-231-0/+3
|
* shader: Implement Int32 SUATOM/SUREDameerj2021-07-235-0/+233
|
* spirv: Be aware of NAN unaware driversReinUsesLisp2021-07-231-18/+40
|
* spirv: Add SSBO read fallbacks when no aliasing is availableReinUsesLisp2021-07-231-37/+99
|
* spirv: Add OpKill fallback to demoteReinUsesLisp2021-07-231-2/+6
|
* spirv: Do not enable ShaderLayerReinUsesLisp2021-07-231-3/+0
| | | | This is enabled by an extension instead of the capability.
* spirv: Enable DemoteToHelperInvocationEXT only when supportedReinUsesLisp2021-07-231-1/+1
|
* spirv: Use OriginLowerLeft when requestedReinUsesLisp2021-07-231-1/+5
|
* spirv: Only add image operands mask when neededReinUsesLisp2021-07-231-5/+9
|
* spirv: Workaround image unsigned offset bugReinUsesLisp2021-07-232-9/+26
| | | | | Workaround bug on Nvidia's OpenGL SPIR-V compiler when using unsigned texture offsets.
* spirv: Add int8 and int16 capabilities only when supportedReinUsesLisp2021-07-231-2/+2
|
* spirv: Add integer clamping workaroundsReinUsesLisp2021-07-231-4/+34
| | | | Workaround more bugs on Nvidia's OpenGL SPIR-V compiler.
* spirv: Implement int8 and int16 conversion fallbacksReinUsesLisp2021-07-231-19/+80
|
* spirv: Support OpenGL uniform buffers and change bindingsReinUsesLisp2021-07-235-56/+163
|
* spirv: Desambiguate descriptor namesReinUsesLisp2021-07-231-9/+37
| | | | | Worksaround a bug on Nvidia's OpenGL SPIR-V compiler where names are used for name matching.
* shader: Implement VertexA stageFernandoS272021-07-232-0/+5
|
* shader: Fix storage type when reading patches on tess controlReinUsesLisp2021-07-231-1/+2
|
* shader: Implement indexed texturesReinUsesLisp2021-07-233-56/+92
|
* shader: Move microinstruction header to the value headerReinUsesLisp2021-07-232-2/+0
|
* shader: Add NVN storage buffer fallbacksReinUsesLisp2021-07-235-24/+109
| | | | | | | When we can't track the SSBO origin of a global memory instruction, leave it as a global memory operation and assume these pointers are in the NVN storage buffer slots, then apply a linear search in the shader's runtime.
* spirv: Fix ViewportMaskReinUsesLisp2021-07-231-1/+2
|
* spirv: Replace Constant/ConstantComposite with Const helperameerj2021-07-2312-112/+101
|
* shader: Address feedbackFernandoS272021-07-232-5/+2
|
* shader: Add coarse derivativesFernandoS272021-07-232-0/+12
|
* shader: Implement fine derivates constant propagationFernandoS272021-07-233-0/+15
|
* shader: Implement SR_Y_DIRECTIONFernandoS272021-07-232-0/+8
|
* shader: Fix memory barriersReinUsesLisp2021-07-232-12/+7
|
* spirv: Fix implicit lod typeReinUsesLisp2021-07-232-1/+5
|
* spirv: Use explicit lods outside of fragment shadersReinUsesLisp2021-07-231-5/+16
|
* spirv: Use ConstOffset instead of Offset when possibleReinUsesLisp2021-07-233-21/+67
|
* shader: Implement BFE and BFI CCameerj2021-07-231-2/+1
| | | | Fix two bugs in BFI.
* shader: Implement SampleMaskReinUsesLisp2021-07-234-1/+10
|
* shader: Implement PIXLD.MY_INDEXReinUsesLisp2021-07-235-0/+12
|
* spirv: Bitcast non-F32 output attributes to their type before storeReinUsesLisp2021-07-231-13/+28
|
* spirv: Implement ViewportMask with NV_viewport_array2ReinUsesLisp2021-07-234-0/+15
|
* spirv: Bitcast non-F32 attributes to F32ReinUsesLisp2021-07-231-7/+9
|
* shader: Implement PrimitiveIdReinUsesLisp2021-07-233-0/+6
|
* shader: Implement tessellation shaders, polygon mode and invocation idReinUsesLisp2021-07-235-55/+232
|
* spirv: Implement image buffersReinUsesLisp2021-07-233-22/+58
|
* spirv: Implement Layer storesReinUsesLisp2021-07-234-9/+26
|
* spirv: Fix alpha testFernandoS272021-07-231-0/+5
|
* spirv: Fix non-atomic 64-bit storeameerj2021-07-231-1/+1
|
* spirv: Implement alpha testameerj2021-07-231-0/+45
|
* shader: Implement transform feedbacks and define file formatReinUsesLisp2021-07-235-16/+97
|
* shader: Implement early Z testsReinUsesLisp2021-07-231-0/+3
|
* spirv: Rework storage buffers and shader memoryReinUsesLisp2021-07-236-492/+487
|
* shader: Fix fixed pipeline point size on geometry shadersReinUsesLisp2021-07-231-10/+18
|
* shader: Implement geometry shadersReinUsesLisp2021-07-235-65/+163
|
* shader: Implement OUTReinUsesLisp2021-07-232-0/+10
|
* spirv: Define StorageImageWriteWithoutFormat capability when usedReinUsesLisp2021-07-231-0/+3
|
* shader: Implement LOP CCReinUsesLisp2021-07-232-9/+18
|
* shader: Implement SR_THREAD_KILLReinUsesLisp2021-07-234-0/+9
|
* shader: Implement ATOM/S and REDameerj2021-07-235-4/+800
|
* spirv: Move phi node patching to a separate functionReinUsesLisp2021-07-231-13/+16
|
* spirv: Guard against typeless image reads on unsupported devicesReinUsesLisp2021-07-232-1/+7
|
* shader: Move LaneId to the warp emission file and fix AMDReinUsesLisp2021-07-233-5/+9
|
* spirv: Fix forward declarations on phi nodesReinUsesLisp2021-07-231-47/+25
|
* shader: Implement SULD and SUSTReinUsesLisp2021-07-235-29/+130
|
* shader: Address feedback + clang formatlat9nq2021-07-232-5/+0
|
* shader_recompiler,video_core: Cleanup some GCC and Clang errorslat9nq2021-07-234-14/+22
| | | | | | | | | | | | | | | | | Mostly fixing unused *, implicit conversion, braced scalar init, fpermissive, and some others. Some Clang errors likely remain in video_core, and std::ranges is still a pertinent issue in shader_recompiler shader_recompiler: cmake: Force bracket depth to 1024 on Clang Increases the maximum fold expression depth thread_worker: Include condition_variable Don't use list initializers in control flow Co-authored-by: ReinUsesLisp <reinuseslisp@airmail.cc>
* shader: Interact texture buffers with buffer cacheReinUsesLisp2021-07-232-28/+28
|
* shader: Fix TextureGradReinUsesLisp2021-07-231-1/+1
|
* shader: Implement texture buffersReinUsesLisp2021-07-234-5/+54
|
* shader: Address feedbackFernandoS272021-07-231-20/+20
|
* shader: Implement indexed Position and ClipDistancesFernandoS272021-07-231-0/+40
|
* shader: Implement indexed attributesFernandoS272021-07-234-6/+134
|
* shader: Fix ShadowCube declaration type, set number of pipeline threads based on hardwareFernandoS272021-07-231-1/+1
|
* shader: Add subgroup masksReinUsesLisp2021-07-234-10/+56
|
* shader: Implement BAR and fix memory barriersReinUsesLisp2021-07-232-3/+13
|
* shader: Reimplement GetCbufU64 as GetCbufU32x2ReinUsesLisp2021-07-234-6/+6
| | | | It may generate better code on some compilers and it's easier to handle.
* shader: Address FeedbackFernandoS272021-07-236-71/+26
|
* shader: Implement SR_LaneIdFernandoS272021-07-232-0/+5
|
* shader: Fix shared memory on cool driversFernandoS272021-07-231-0/+1
|
* shader: Implement MEMBARFernandoS272021-07-232-0/+41
|
* shader: Improve VOTE.VTG stubFernandoS272021-07-232-0/+40
|
* shader: Implement ViewportIndexFernandoS272021-07-234-2/+27
|
* shader: Stub TLD4's PTP when it isn't constantFernandoS272021-07-231-1/+2
|
* shader: Fix branches to visited virtual blocksReinUsesLisp2021-07-231-0/+2
|
* shader: Fix dependency on identity removal passReinUsesLisp2021-07-232-3/+8
|
* spirv: Remove unnecesary variable for clip distancesReinUsesLisp2021-07-232-6/+2
|
* shader: Implement ClipDistanceFernandoS272021-07-233-0/+25
|
* shader: Fix TXDFernandoS272021-07-231-1/+1
|
* shader: Address feedbackFernandoS272021-07-231-15/+15
|
* shader: Implement ImageGradientFernandoS272021-07-232-1/+54
|
* shader: Implement TMML partiallyFernandoS272021-07-231-2/+2
|
* shader,spirv: Implement ImageQueryLod.FernandoS272021-07-234-0/+20
|
* shader: Implement TLDFernandoS272021-07-231-1/+1
|
* spirv: Add fixed pipeline point sizeReinUsesLisp2021-07-232-1/+5
|
* shader: Add PointCoord attributeFernandoS272021-07-233-0/+11
|
* shader: Add PointSize attributeameerj2021-07-233-0/+9
|
* shader: Store type of phi nodes in flagsReinUsesLisp2021-07-231-1/+2
| | | | This is needed because pseudo-instructions where invalidated.
* spirv: Fix default output attribute initializationReinUsesLisp2021-07-231-3/+3
|
* shader: Implement FSWZADDameerj2021-07-234-0/+27
|
* shader: Implement BRXFernandoS272021-07-233-2/+15
|
* shader: Implement I2I CCameerj2021-07-232-21/+41
|
* shader: Implement I2I SATameerj2021-07-232-0/+10
|
* shader: Implement LDS, STS, LDL, and STS and use SPIR-V 1.4 when availableReinUsesLisp2021-07-235-1/+334
|
* shader: Better interpolation and disabled attributes supportReinUsesLisp2021-07-232-12/+46
|
* spirv: Remove dependencies on Environment when generating SPIR-VReinUsesLisp2021-07-232-9/+7
|
* shader: Implement front faceReinUsesLisp2021-07-233-0/+8
|
* shader: Implement TXQ and fix FragDepthReinUsesLisp2021-07-235-8/+64
|
* shader: Refactor PTP and other minor changesReinUsesLisp2021-07-235-56/+46
|
* shader: Add IR opcode for ImageFetchFernandoS272021-07-233-3/+35
|
* shader: Implement TLD4.PTPFernandoS272021-07-235-9/+40
|
* shader: Implement FragDepthFernandoS272021-07-231-0/+6
|
* shader: Implement TLD4 and TLD4_BFernandoS272021-07-232-0/+48
|
* shader: Implement SHFLameerj2021-07-235-60/+151
|
* shader: Fix F2IFernandoS272021-07-232-4/+19
|
* shader: Implement NDC [-1, 1], attribute types and default varying initializationReinUsesLisp2021-07-235-39/+112
|
* shader: Implement VOTEameerj2021-07-235-1/+77
|
* shader: Implement DMNMX, DSET, DSETPameerj2021-07-233-18/+26
|
* shader: Add support for fp16 comparisons and misc fixesReinUsesLisp2021-07-232-0/+10
|
* spirv: Implement VertexId and InstanceId, refactor codeReinUsesLisp2021-07-236-144/+234
|
* shader: Implement I2FReinUsesLisp2021-07-234-0/+67
|
* shader: Add partial rasterizer integrationReinUsesLisp2021-07-236-50/+172
|
* shader: Implement DADDameerj2021-07-231-0/+2
|
* shader: Implement FSET and FSETPameerj2021-07-231-2/+4
| | | | Also fix oversight with adding SignedZeroInfNanPreserve execution mode.
* shader: Implement TEXSReinUsesLisp2021-07-231-1/+10
|
* shader: Implement CAL inlining function callsReinUsesLisp2021-07-232-15/+8
|
* spirv: Add SignedZeroInfNanPreserve logicameerj2021-07-231-0/+6
|
* shader: Implement FMNMXameerj2021-07-232-12/+12
| | | | And add a const in FCMP
* shader: Implement FCMPameerj2021-07-232-0/+5
| | | | still need to configure some settings for NV denorm flush and intel NaN
* shader: Partial implementation of LDCReinUsesLisp2021-07-234-25/+124
|
* shader: Initial support for textures and TEXReinUsesLisp2021-07-237-5/+327
|
* shader: Implement R2Pameerj2021-07-233-3/+13
|
* shader: Implement SHFameerj2021-07-232-9/+19
|
* shader: Implement LEAameerj2021-07-233-3/+13
|
* shader: Implement HADD2ReinUsesLisp2021-07-233-18/+98
|
* shader: Implement FLOameerj2021-07-232-3/+13
|
* shader: Implement IMNMXameerj2021-07-232-0/+20
|
* shader: Implement POPCameerj2021-07-232-0/+10
|
* shader: Implement SHRameerj2021-07-232-6/+11
|
* spirv: Move phi arguments emit to a separate functionReinUsesLisp2021-07-231-27/+27
|
* spirv: Add support for self-referencing phi nodesReinUsesLisp2021-07-231-3/+10
|
* shader: Implement more of XMAD and FFMA32I and fix XMAD.CBCCReinUsesLisp2021-07-232-6/+6
|
* shader: FMUL, select, RRO, and MUFU fixesReinUsesLisp2021-07-233-52/+228
|
* shader: Fix MOV(reg), add SHL variants and emit neg and abs instructionsReinUsesLisp2021-07-232-6/+6
|
* spirv: Fixes and Intel specific workaroundsReinUsesLisp2021-07-236-20/+31
|
* shader: Rename, implement FADD.SAT and P2R (imm)ReinUsesLisp2021-07-234-79/+98
|
* shader: Add denorm flush supportReinUsesLisp2021-07-233-9/+64
|
* spirv: Add lower fp16 to fp32 passReinUsesLisp2021-07-2312-210/+255
|
* shader: Primitive Vulkan integrationReinUsesLisp2021-07-2313-467/+461
|
* spirv: Implement EmitIdentityReinUsesLisp2021-07-232-3/+3
|
* spirv: Initial bindings supportReinUsesLisp2021-07-2312-218/+408
|
* shader: Add support for forward declarationsReinUsesLisp2021-07-232-52/+49
|
* shader: Support SSA loops on IRReinUsesLisp2021-07-233-15/+15
|
* shader: Misc fixesReinUsesLisp2021-07-232-0/+11
|
* shader: Initial implementation of an ASTReinUsesLisp2021-07-235-41/+67
|
* spirv: Initial SPIR-V supportReinUsesLisp2021-07-2312-3/+1359
|
* shader: Add pools and rename filesReinUsesLisp2021-07-231-0/+21