use long to avoid overflow in num_threads
For some new GPUs, memory is increased up to 40GB, and some of the int variables that store the number of threads overflow. Here I change these with long type to avoid overflow.
For some new GPUs, memory is increased up to 40GB, and some of the int variables that store the number of threads overflow. Here I change these with long type to avoid overflow.