Coloring is done with ANSI escape codes. The only supported mode is 24-bit TrueColor. Frame rate highly depends on terminal printing speed. The one I use is Alacritty. It gives me about 38 FPS for ...
#define CK(x) do{cudaError_t e=(x);if(e){fprintf(stderr,"CUDA %s:%d %s\n",__FILE__,__LINE__,cudaGetErrorString(e));exit(1);}}while(0) __device__ __inline__ int min ...