You may have to use the gpu_memory_limit and/or lora_on_cpu config options to stay away from running out of memory. If you continue to operate out of CUDA memory, you could try and merge in program RAM with
Posted https://prbookmarkingwebsites.com/story19901558/https-im-token-io-com-things-to-know-before-you-buy