Next.js App Router + React Server Components Demo

24 points by mikepapadim 4 days ago | 6 comments

mikepapadim 4 days ago [-]

lostmsu 3 days ago [-]

Does it support flash attention? Use tensor cores? Can I write custom kernels?

UPD. found no evidence that it supports tensor cores, so it's going to be many times slower than implementations that do.

mikepapadim 3 days ago [-]

Yes, when you use the PTX backend it supports Tensor Cores.It has also implementation for flash attention. You can also write your own kernels, have a look here: https://github.com/beehive-lab/GPULlama3.java/blob/main/src/... https://github.com/beehive-lab/GPULlama3.java/blob/main/src/...

lostmsu 3 days ago [-]

TornadoVM GitHub has no mentions of tensor cores or WMMA instructions. The only mention of tensor cores is in 2024 and states they are not used: https://github.com/beehive-lab/TornadoVM/discussions/393

mikepapadim 3 days ago [-]

lostmsu 1 days ago [-]

I believe these are SIMD. Tensor cores require MMA family of instructions. Ask me how I know. :)

sliicemasternet 3 days ago [-]

[dead]

Rendered at 09:25:59 GMT+0000 (Coordinated Universal Time) with Vercel.