Llama-3-8B-InstructOnline
System Initialized. I am Llama-3-8B-Instruct, loaded and ready for inference. You can adjust generation parameters like Temperature and Top-P from the settings menu. How can I assist you today?
A100 80GB 42 tok/s 340ms latency