This should be impossible on a 4GB GPU with 80GB RAM, but there you have it. MPDOK at 8x faster than SciPy (15x when N=128K).
No download links available.