We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent e9eb4ba commit e4431f9Copy full SHA for e4431f9
content/posts/meetup-56-wrapup.md
@@ -71,7 +71,7 @@ and an [AMD AI MAX+ 395 with an
71
| FP16 (theoretical) | 59.4 TFLOPS | ~19.2 TFLOPS |
72
| Memory Bandwidth | ~212 GB/s (DDR5-8000) | 280 GB/s (GDDR6) |
73
74
-However, *[prefil](https://huggingface.co/blog/tngtech/llm-performance-prefill-decode-concurrent-requests)l* is a bit faster on the nvidia card:
+However, *[prefill](https://huggingface.co/blog/tngtech/llm-performance-prefill-decode-concurrent-requests)* is a bit faster on the nvidia card:
75
76
```
77
$ time OLLAMA_MODEL=qwen3:14b OLLAMA_HOST=http://ada:11434 ./one -m "how warm is it in leipzig?"
0 commit comments