#dri-devel on 2025-05-31 — irc logs at oftc.catirclogs.org

2024-07-16 04:52 ChanServ changed the topic of #dri-devel to: <ajax> nothing involved with X should ever be unable to find a bar

00:02 calico__ has joined #dri-devel

00:09 calico_ has quit [Ping timeout: 480 seconds]

00:26 feaneron has quit [Read error: Connection reset by peer]

00:26 feaneron has joined #dri-devel

00:49 epoch101 has joined #dri-devel

01:07 epoch101 has quit []

01:09 cphealy has quit [Ping timeout: 480 seconds]

01:18 nerdopolis has joined #dri-devel

01:20 Emantor has quit [Quit: ZNC - http://znc.in]

01:20 Emantor has joined #dri-devel

01:44 The_Company has joined #dri-devel

01:44 The_Company has quit [Remote host closed the connection]

01:46 asrivats__ has joined #dri-devel

01:51 Company has quit [Ping timeout: 480 seconds]

01:54 asrivats__ has quit [Ping timeout: 480 seconds]

02:01 zzyiwei has joined #dri-devel

02:08 Daanct12 has joined #dri-devel

02:12 Danct12 has quit [Quit: ZNC 1.9.1 - https://znc.in]

02:18 Danct12 has joined #dri-devel

02:31 nerdopolis has quit [Ping timeout: 480 seconds]

03:00 davispuh has quit [Ping timeout: 480 seconds]

03:08 calico__ has quit [Ping timeout: 480 seconds]

03:21 asrivats__ has joined #dri-devel

03:21 guludo has quit [Quit: WeeChat 4.6.3]

03:22 cphealy has joined #dri-devel

03:25 glennk has joined #dri-devel

03:29 asrivats__ has quit [Ping timeout: 480 seconds]

03:35 feaneron has quit [Read error: Connection reset by peer]

03:35 feaneron has joined #dri-devel

04:12 zzyiwei has left #dri-devel [#dri-devel]

04:12 zzyiwei has joined #dri-devel

04:18 Stary has quit [Quit: ZNC - http://znc.in]

04:23 Stary has joined #dri-devel

04:26 Duke`` has joined #dri-devel

04:38 fab has joined #dri-devel

04:56 alarumbe has quit []

05:17 feaneron has quit [Quit: feaneron]

06:09 Daanct12 has quit [Quit: WeeChat 4.6.3]

06:12 Daanct12 has joined #dri-devel

06:16 hikiko_ has joined #dri-devel

06:20 hikiko has quit [Ping timeout: 480 seconds]

06:25 mriesch has quit [Remote host closed the connection]

06:27 mriesch has joined #dri-devel

06:43 blaztinn has quit [Remote host closed the connection]

06:44 blaztinn has joined #dri-devel

06:58 rasterman has joined #dri-devel

07:00 sghuge has quit [Remote host closed the connection]

07:00 sghuge has joined #dri-devel

07:06 kzd has quit [Ping timeout: 480 seconds]

07:28 coldfeet has joined #dri-devel

07:41 bolson has quit [Ping timeout: 480 seconds]

07:46 hikiko has joined #dri-devel

07:49 warpme has joined #dri-devel

07:51 hikiko_ has quit [Ping timeout: 480 seconds]

07:57 Caterpillar has quit [Quit: Konversation terminated!]

08:44 vliaskov__ has joined #dri-devel

09:00 Daanct12 has quit [Ping timeout: 480 seconds]

09:03 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

09:03 TMM has joined #dri-devel

09:06 haaninjo has joined #dri-devel

09:08 kts has joined #dri-devel

09:14 hikiko_ has joined #dri-devel

09:20 hikiko has quit [Ping timeout: 480 seconds]

09:22 vliaskov_ has joined #dri-devel

09:22 kts has quit [Ping timeout: 480 seconds]

09:24 Daanct12 has joined #dri-devel

09:27 kts has joined #dri-devel

09:28 vliaskov__ has quit [Ping timeout: 480 seconds]

09:34 warpme has quit []

09:39 kts has quit [Ping timeout: 480 seconds]

09:45 kts has joined #dri-devel

09:51 tomba has quit [Quit: ZNC 1.9.0+deb2build3 - https://znc.in]

09:52 tomba has joined #dri-devel

09:57 hikiko has joined #dri-devel

10:00 hikiko_ has quit [Ping timeout: 480 seconds]

10:22 pcercuei has joined #dri-devel

10:40 coldfeet has quit [Quit: Lost terminal]

10:40 vliaskov_ has quit [Ping timeout: 480 seconds]

10:53 kts has quit [Ping timeout: 480 seconds]

11:25 gouchi has joined #dri-devel

11:28 warpme has joined #dri-devel

11:36 kts has joined #dri-devel

11:39 ammen99 has quit [Remote host closed the connection]

12:00 bolson has joined #dri-devel

12:04 tobiasjakobi has joined #dri-devel

12:05 tobiasjakobi has quit []

12:10 JRepinc has quit [Ping timeout: 480 seconds]

12:11 kts has quit [Ping timeout: 480 seconds]

12:14 FAQ_ has joined #dri-devel

12:15 JRepinc has joined #dri-devel

12:16 warpme has quit []

12:25 kts has joined #dri-devel

12:28 Jeremy_Rand_Talos has joined #dri-devel

12:38 nerdopolis has joined #dri-devel

12:40 Alisa[m] has joined #dri-devel

12:50 coldfeet has joined #dri-devel

12:56 JRepin has joined #dri-devel

12:56 JRepinc has quit [Read error: Connection reset by peer]

13:03 nerdopolis has quit [Ping timeout: 480 seconds]

13:09 Daanct12 has quit [Quit: WeeChat 4.6.3]

13:14 hikiko_ has joined #dri-devel

13:16 kts has quit [Ping timeout: 480 seconds]

13:18 hikiko has quit [Ping timeout: 480 seconds]

13:21 asrivats__ has joined #dri-devel

13:24 kts has joined #dri-devel

13:29 asrivats__ has quit [Ping timeout: 480 seconds]

13:30 nerdopolis has joined #dri-devel

13:37 gouchi has quit [Remote host closed the connection]

14:01 kts has quit [Ping timeout: 480 seconds]

14:11 nerdopolis has quit [Ping timeout: 480 seconds]

14:18 pcercuei has quit [Remote host closed the connection]

14:30 Alisa[m] has quit [autokilled: This host violated network policy. Contact support@oftc.net for further information and assistance. (2025-05-31 14:30:44)]

14:42 garrison has quit []

14:43 puck_ has quit [Ping timeout: 480 seconds]

14:45 i-garrison has joined #dri-devel

15:03 vliaskov_ has joined #dri-devel

15:03 coldfeet has quit [Quit: Lost terminal]

15:10 FAQ_ has quit []

15:17 kzd has joined #dri-devel

15:19 FAQ_ has joined #dri-devel

15:21 <robclark> karolherbst: btw, https://gitlab.freedesktop.org/mesa/mesa/-/blob/main/src/gallium/frontends/rusticl/core/event.rs?ref_type=heads#L264 .. that should probably happen after queue.finish() somehow? Since a blocking read of a query will trigger a flush (which you probably don't want if there are multiple queue.enqueueNDRangeKernel(), and even if there is only one, it will trigger an extra execbuf/submit ioctl to get a fence)

15:22 <karolherbst> robclark: sure, but the entire profiling needs to be reworked anyway

15:23 <robclark> ok, as long as $new_thing dtrt

15:23 <karolherbst> the thing is, that start/end events are about when the GPU starts/ends processing commands

15:23 <karolherbst> and I don't really want to read those values out on the CPU

15:24 <karolherbst> should use `get_query_result_resource` instead of `get_query_result`

15:24 <karolherbst> and have a slab allocated heap that I can just throw into it

15:25 <robclark> well, app is going to _eventually_ read them out on CPU.. I guess you could use the get_query_result() path, but that is a pita for me since I can't properly convert ticks to ms on the gpu without spinning up a compute shader ;-)

15:26 <robclark> so for timestamp queries I have to kinda fake it for get_query_result_resource()

15:26 <karolherbst> mhhh

15:26 <robclark> if you just kept a list/vector/whatever of the queries and then did readback after flush it wouldn't be so bad

15:27 <karolherbst> the thing is.. I just submit commands, and I need to know when the GPU starts/end processing a command, an a query object thrown into the command stream is generally how it should be (tm). Though I can see that on tilers things are different (tm)

15:27 <robclark> for compute, fortunately tiling isn't a thing.. it really just about not being able to convert to ms on the CP

15:28 <karolherbst> it doesn't have to be ms or something afaik

15:28 nerdopolis has joined #dri-devel

15:28 <robclark> or us or something.. I forget what units the result is defined in, but it is a unit of time, not ticks

15:28 <karolherbst> oh actually.. the spec wants it in ns

15:28 <robclark> ahh, right

15:29 dsimic is now known as Guest17075

15:29 dsimic has joined #dri-devel

15:30 <robclark> hmm, although I wonder if I could get something added to fw.. we really just need to multiply by (approx) 52

15:30 Guest17075 has quit [Ping timeout: 480 seconds]

15:31 <robclark> I guess 52 is close enough to 52.083333333

15:33 <robclark> still, it would be easier to do on cpu

15:33 <karolherbst> robclark: anyway.. the idea was to insert a timestamp query object, do a bunch of gallium commands, do the second timestamp qery and just read out the results once the GPU is done

15:34 <robclark> yeah, and as you pointed out one way to do that is get_query_result_resource().. the other is to track the query objects

15:35 <karolherbst> how are queries implemented for you anyway? Is it like a command stream thing where the GPU writes a timestamp into a location at some point in time, or is it more like.. done on the cpu side?

15:35 <robclark> it's on the GPU

15:35 <karolherbst> okay

15:36 <karolherbst> yeah when it's writing the values asynchronously to a buffer I can map later, that's perfect. We _could_ do CPU side fixs ups of the value

15:36 <karolherbst> what's just important is, that I don't want to stall the pipeline with busy waits

15:36 <karolherbst> as I'm doing atm

15:36 <robclark> right

15:36 <robclark> well, with get_query_result_resource() you eventually stall on the result resource

15:36 <karolherbst> but I could apply a factor before reporting back the values

15:37 <karolherbst> well sure

15:37 <robclark> so it amounts to the same thing.. but I could see get_query_result_resource() being easier to implement

15:37 <karolherbst> I can map without stalling

15:37 <robclark> yeah, I guess we could add a pipe cap to adjust the result on the cpu

15:37 <karolherbst> PIPE_MAP_UNSYNCHRONIZED or something

15:38 <robclark> sure, but I guess you want to actually _have_ the result on the CPU at some point ;-)

15:38 <karolherbst> yeah... needs to flush the thing at some point

15:38 <karolherbst> but.

15:38 <karolherbst> you can also copy to a second resource

15:38 <karolherbst> and map that one without stalling :D

15:39 <robclark> only for readback of result on the GPU

15:39 <karolherbst> there are a few tricks how the actual important main work can be left alone doing it's stuff

15:39 <robclark> (idk if cl can do that)

15:39 <karolherbst> oh the CL API doesn't care about how it's implemented really

15:39 <robclark> if you are reading back on the CPU, then you have to wait

15:39 <karolherbst> it just gives you raw values

15:39 <robclark> right, but on the _CPU_

15:39 <karolherbst> sure

15:40 <robclark> so unless you invent time travel, it needs to wait on GPU somewhere ;-)

15:40 <robclark> (but moar fps via time travel would be a neat trick)

15:40 <karolherbst> not really if you e.g. tell the GPU to write the results into a coherent/staging buffer

15:41 <robclark> sure, but cpu needs to read it after gpu writes it

15:41 <karolherbst> right, you can't prevent that one :)

15:42 <karolherbst> but atm, we do the read after each even is processed, then stall the GPU then execute the next event

15:42 <karolherbst> that just stalls the CL queue all the time

15:43 <karolherbst> an "event" here is like a cl queue command

15:43 <robclark> right, either the defer query object read in getProfilingInfo() or stall and read result rsc in getProfilingInfo() would amount to the same thing

15:43 <robclark> it would stall until result is avail if it isn't already

15:43 <karolherbst> nah, it's done on different threads

15:43 <karolherbst> there is a queue thread working through the commands

15:44 <robclark> gpu doesn't care so much about cpu threads

15:44 <karolherbst> and that's stalling the GPU side of things with the current implementation constantly

15:44 <robclark> sure, ofc

15:44 <karolherbst> with get_query_result_resource the GPU would just write the result into some buffer working through the commands

15:44 <karolherbst> and then at some point it gets read out once the queue is flushed/finished or so

15:45 <karolherbst> but that happens on a different thread then and wouldn't bother the queue one

15:45 <robclark> but if that thread just pushed the queue objects to some data structure, and deferred get_query_result() until getProfilingInfo() is called.. then you don't stall any more than you would with the get_query_result_resource() approach

15:45 <karolherbst> getProfilingInfo doesn't call into get_query_result

15:45 <karolherbst> the get_query_result happens way earlier

15:45 <robclark> right, that is the problem

15:46 <robclark> oh, but I guess you might need extra locking with my approach to avoid calling into ctx on multiple threads

15:46 <karolherbst> yeah and instead of get_query_result, I want to use get_query_result_resource so it's not constantly waiting on the GPU. And getProfilingInfo simply reads from the buffer instead of temporary values the results of get_query_result were written to

15:47 <karolherbst> there is already a bit of indirection going on there, because things are already cursed enough

15:48 <karolherbst> well.. that's why I want to map unsynchronized or something, so I can just map on a different context

15:48 <karolherbst> need to figure out the details at some point

15:48 <karolherbst> maybe I just do a resource_from_user_memory thing...

15:49 <robclark> yeah, for the result rsc approach, that would work, because you can wait on fence on any thread

15:49 <karolherbst> yeah

15:49 <karolherbst> just need to wait until the GPU is actually done, maybe make sure the results are flushed, but then it should work in principle

15:51 <robclark> let me look into whether CP_TICKS_TO_NS is something that I can talk someone into.. I guess it at least has a non-zero chance now..

15:51 <karolherbst> could also just collect all the query results whenever I had to wait on a fence anyway

15:51 <robclark> and it would be useful for qbo

15:52 <karolherbst> could be default 1

15:53 <karolherbst> it's a bit more problematic with GL, because you can hand out a GPU resource to applications with the raw data, no?

15:53 <karolherbst> or well.. have it written to a memory object

15:53 <robclark> right, right now I just fail the big gl qbo timestamp/elapsed tests

15:53 <robclark> with CP_TICKS_TO_NS would help

15:54 <karolherbst> yeah..

15:54 <karolherbst> anyway.. it's harder to get all the gl bits correct here. I can just apply a factor if needed, that's not really a big issue

15:54 <robclark> s/with/which/

15:55 <robclark> yeah, I guess we could do that as the workaround if we had to (or at least do that when fw is too old)

16:09 <karolherbst> mhh, I think I just understood what you were trying to explain earlier 🙃.. I guess I could move the `get_query_result` calls to a later place and only do that after waiting on related fences anyway...

16:13 gouchi has joined #dri-devel

16:14 <karolherbst> anyway, that would also require rewriting msot of the profiling code, for weird reason

16:14 <karolherbst> s

16:15 <robclark> yeah.. but if you are calling into pctx on a different thread from app thread, the threading might be a bit awkward

16:15 <robclark> but other than that detail the two approaches are the "same"

16:16 <karolherbst> yeah... atm PipeQuery stores a reference to the Context, and for rust reasons it would be a bit painful to delay reading it out. So writing into a resource would get around that part

16:17 <karolherbst> because then I won't have to keep the query object around

16:17 gouchi has quit []

16:17 <karolherbst> And using host visible memory mapped into the GPU might just make everything trivial enough to handle

16:18 <robclark> yeah, you'd have to wait on a fence but no threading constraints there

16:18 <karolherbst> or it's a persistent mapping and the event object just gets a pointer into a slice allocated for it

16:18 <karolherbst> and then it's just reading from a pointer (after a flush/wait) and nothing else matters

16:18 <karolherbst> anyway...

16:19 <karolherbst> I have ideas, and I'd just need to figure out what I like the most

16:20 <karolherbst> I think I like the idea of using a coherent + persistent mapping thing, because then I don't have to bother on the CPU side with copying values around and the results just appear at the right location at some point

16:24 coldfeet has joined #dri-devel

16:43 chamlis has quit [Remote host closed the connection]

16:44 chamlis has joined #dri-devel

16:52 parthiban has quit [Quit: Leaving]

16:59 kts has joined #dri-devel

17:05 alarumbe has joined #dri-devel

17:09 rasterman has quit [Quit: Gettin' stinky!]

17:35 asrivats__ has joined #dri-devel

17:54 asrivats__ has quit [Ping timeout: 480 seconds]

17:56 Core4496 has joined #dri-devel

17:57 Core1055 has joined #dri-devel

18:01 zzyiwei has quit [Ping timeout: 480 seconds]

18:03 gouchi has joined #dri-devel

18:04 Core4496 has quit [Ping timeout: 480 seconds]

18:05 gouchi has quit []

18:07 epoch101 has joined #dri-devel

18:19 coldfeet has quit [Quit: Lost terminal]

18:29 Caterpillar has joined #dri-devel

18:32 TMM has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

18:32 TMM has joined #dri-devel

18:37 epoch101 has quit [Ping timeout: 480 seconds]

18:39 quantum58 has quit []

18:39 quantum58 has joined #dri-devel

18:43 kts has quit [Quit: Konversation terminated!]

19:18 quantum58 has quit []

19:18 quantum58 has joined #dri-devel

19:24 quantum58 has quit []

19:24 quantum58 has joined #dri-devel

19:26 alanc has quit [Remote host closed the connection]

19:27 alanc has joined #dri-devel

19:35 Core1055 has quit [Read error: Connection reset by peer]

19:35 zzyiwei has joined #dri-devel

19:42 FAQ_ has quit [Remote host closed the connection]

19:49 Company has joined #dri-devel

20:05 fab has quit [Quit: fab]

20:12 quantum58 has quit []

20:13 quantum58 has joined #dri-devel

20:43 Core9612 has joined #dri-devel

20:46 Core4364 has joined #dri-devel

20:49 zzyiwei has quit [Ping timeout: 480 seconds]

20:51 Core9612 has quit [Ping timeout: 480 seconds]

21:57 nerdopolis has quit [Ping timeout: 480 seconds]

21:59 asrivats__ has joined #dri-devel

22:08 asrivats__ has quit [Ping timeout: 480 seconds]

22:39 nerdopolis has joined #dri-devel

23:04 haaninjo has quit [Quit: Ex-Chat]

23:23 Duke`` has quit [Ping timeout: 480 seconds]

23:24 YuGiOhJCJ has joined #dri-devel

23:42 Core4364 has quit [Read error: Connection reset by peer]

23:43 zzyiwei has joined #dri-devel

23:55 fantom_ has quit [Ping timeout: 480 seconds]

23:59 jernej has quit [Read error: Connection reset by peer]

23:59 DarkShadow4444 has quit [Write error: connection closed]

23:59 jernej has joined #dri-devel

23:59 DarkShadow44 has joined #dri-devel