seekingamber
New member
Today, almost all SaaS/app wants to build integration with AI. A short story to share -
Fina Money uses LLM to power up its answer to users' financial questions. Initially using OpenAI's API. Observed the slow response on GPT-4 model, it makes me think, are there any alternatives that we may consider to balance the workload?
However, not all LLM models have the same quality to achieve the accuracy we want, this makes me test out a list of models available, and have a sense about what the landscape looks like regarding
Fina Money uses LLM to power up its answer to users' financial questions. Initially using OpenAI's API. Observed the slow response on GPT-4 model, it makes me think, are there any alternatives that we may consider to balance the workload?
However, not all LLM models have the same quality to achieve the accuracy we want, this makes me test out a list of models available, and have a sense about what the landscape looks like regarding
- Accuracy
- Speed
- gpt-4-turbo
- gpt-3.5-turbo
- llama3-8b-8192
- llama3-70b-8192
- gemma-7b-it
- mixtral-8x7b-32768