Hi everyone I currently am running an amd 7090 on bluefin with developer mode enabled. After testing I found out that 10 - 15 token per second. But after research I’ve found for coding it’s better for the model to run 15-20 tokens a second. If needed I can provide any more info. Thanks in advance!