Papers I've read this week: vision language models
Papers I've read this week
How does batching work on modern GPUs?
Where do LLMs spend their FLOPS?