Visual token compression
16x is faster; 4x keeps more visual detail.
64 2048
1 36
8 128
1 5
Examples