bruno pfp
bruno

@brunostefoni

I'm in farhack and just finished setting up a Gradio server emulating openai API using locally hosted custom Llama 3 with vLLM. Awesome times
0 reply
0 recast
0 reaction