do you have 32GB of free RAM? group offloading with cuda streams needs a lot of RAM but it's fast, anyway, here is really hard to give help, if you still have problems, please open an issue in the diffusers repo with the code and the error you're getting.