The Benefits of mlock
and mmap
(when using llama.cpp)
what does mlock do? I'm running a C++ app that accepts --mlock
argument. What's the benefit of this?
All about GPUs and Apple Silicon
What are these things?
- GPU cores
- GPU bandwidth
- VRAM
- Swapping
- GPU benchmarks
- Apple Silicon
- Metal vs. MPS
- Batch size
- NVLink (removed in 4090, available in 3090)
- GPU chaining
tell me about GPUs, Apple’s unified memory approach, bandwidth, cores, and other related topics.
How Tailscale reverse proxy works
- Also, what is the difference between a proxy and a “reverse” proxy?
I'm using tailscale serve. I ran a server command on my server machine which runs at localhost and port 8080. I then used the following tailscale command to make this address and port available to my
How GBNF works (context-free grammar used in llama.cpp)
- The meaning of
ws = [ \\t\\n]*