TIL from GPT (ongoing) | Notion

The Benefits of `mlock` and `mmap` (when using llama.cpp)

what does mlock do? I'm running a C++ app that accepts --mlock argument. What's the benefit of this?

All about GPUs and Apple Silicon

What are these things?

GPU cores
GPU bandwidth
VRAM
Swapping
GPU benchmarks
Apple Silicon
Metal vs. MPS
Batch size
NVLink (removed in 4090, available in 3090)
GPU chaining

tell me about GPUs, Apple’s unified memory approach, bandwidth, cores, and other related topics.

How Tailscale reverse proxy works

Also, what is the difference between a proxy and a “reverse” proxy?

I'm using tailscale serve. I ran a server command on my server machine which runs at localhost and port 8080. I then used the following tailscale command to make this address and port available to my

How GBNF works (context-free grammar used in llama.cpp)

The meaning of ws = [ \\t\\n]*