fufexan has quit [Remote host closed the connection]
fufexan has joined #dri-devel
<karolherbst>
zmike: ever ran into the issue where you get "radv/amdgpu: The CS has been rejected, see dmesg for more information (-14)." but dmesg contains nothing? I'm hitting "../src/gallium/drivers/zink/zink_context.c:4265: zink_wait_on_batch: Assertion `batch_id' failed." but no idea why. validation layers are also clean
<tnt>
I got an application triggering "GL_OUT_OF_MEMORY in glTextureStorage2DMultisample(texture too large)". The texture in question is 3582x3582 GL_RGBA32F 8 samples so that should be about a ~1.6G texture. I've got like 15G free RAM and it's an iris iGPU so are there other limits that apply ?
<zmike>
karolherbst: that's a gpu hang
<karolherbst>
yeah sure, but normally amdgpu prints stuff in dmesg
<karolherbst>
which it doesn't
<zmike>
idk that's usually what I see
<karolherbst>
I wonder if it's a kernel bug tho...
<karolherbst>
or something else going on
<karolherbst>
-14 is a weird error code
<karolherbst>
maybe just a post driver forgets to print something, but then again... anyway, I was just curious if you know more
<zmike>
I do not
<zmike>
usually those types of hangs are something being fucked with descriptors
<karolherbst>
thing is it only happens randomly, so not quite sure what sometimes goes wrong
<zmike>
or maybe that's -21
<karolherbst>
I think the -14 comes from validation, so maybe I just add some printks and see if I can figure out why the kernel is unhappy
<zmike>
you could try with vvl and ZINK_DESCRIPTORS=lazy to see if it finds something then
Surkow|laptop has quit [Quit: 418 I'm a teapot - NOP NOP NOP]
sguddati has quit [Ping timeout: 480 seconds]
sguddati has joined #dri-devel
amarsh04 has quit []
Surkow|laptop has joined #dri-devel
Thymo has joined #dri-devel
nerdopolis has joined #dri-devel
<karolherbst>
zmike: okay.. I think I found it, and it's as silly as anticipated. So it's related to userptrs and I think kernels reject the submissions if there are stale buffers referenced whose userptr has been freed by the application already
<karolherbst>
if I make the CTS stop freeing them, I don't submits failing anymore, but the test is failing, so I wonder if userptr support is busted in one or another way
Jeremy_Rand_Talos has quit [Remote host closed the connection]
Jeremy_Rand_Talos has joined #dri-devel
JLP has joined #dri-devel
kzd has joined #dri-devel
coldfeet has joined #dri-devel
kzd has quit [Quit: kzd]
rasterman has joined #dri-devel
haaninjo has joined #dri-devel
sguddati has quit [Ping timeout: 480 seconds]
helmhotz has quit [Ping timeout: 480 seconds]
helmhotz has joined #dri-devel
Aura has quit []
coldfeet has quit [Quit: Lost terminal]
kzd has joined #dri-devel
kts has quit [Quit: Konversation terminated!]
calico has joined #dri-devel
calico_ has quit [Remote host closed the connection]
calico has quit []
calico has joined #dri-devel
coldfeet has joined #dri-devel
chewitt has quit [Quit: Zzz..]
calico has quit [Remote host closed the connection]
calico has joined #dri-devel
JRepin has quit []
JRepin has joined #dri-devel
<karolherbst>
mhhh.. soo it looks like it works in principle, just that parts of the buffer don't contain the expected values... doesn't look like a sync issue, maybe something with caches?
Karyon_ has joined #dri-devel
Karyon has quit [Ping timeout: 480 seconds]
kzd has quit [Quit: kzd]
warpme has joined #dri-devel
warpme has quit []
nerdopolis has quit [Quit: No Ping reply in 180 seconds.]
nerdopolis has joined #dri-devel
bbrezill1 has quit []
bbrezillon has joined #dri-devel
alanc has quit [Remote host closed the connection]
alanc has joined #dri-devel
kzd has joined #dri-devel
kzd has quit [Quit: kzd]
SquareWinter68 has joined #dri-devel
SquareWinter68_ has quit [Ping timeout: 480 seconds]