Faster Ollama alternative

RandomlyRight@sh.itjust.works · 2 days ago

Faster Ollama alternative

Arehandoro@lemmy.ml · 1 day ago

I don’t think it’s OpenAI compatible, but deepseek is faster.

hendrik@palaver.p3x.de · 1 day ago

Btw, Ollama is a software to run AI models. Deepseek is just a company. Or a model file or a service. But that’s not what OP is looking for. They want to run a model. And that needs software like Ollama.

Arehandoro@lemmy.ml · 4 hours ago

Isn’t this a model? https://github.com/deepseek-ai/DeepSeek-V3

(Honest question, not an expert in AI)

hendrik@palaver.p3x.de · edit-2 3 hours ago

Yes, Deepseek V3 is a model. But what I was trying to say, you download the file. But then what? Just having the file stored on your harddisk doesn’t do much. You need to run it. That’s called “inference” in machine learning/AI terms. The repository you linked, contains some example code how to do it with Huggingface’s Transformer library. But there are quite some frameworks out there for running AI models. Ollama would be another one. And it’s not just some example code where to start with your own Python program, but a ready-made project/framework with tools and frontends available and an interface for other software to hook into.

And generally, you need some software to actually do something. And how fast it is, depends on the software used, the hardware it’s executed on. And in this case, also on the size of the AI model and its architecture. But yeah, Deepseek V3 has some tricks up it’s sleeves to make it very efficient. Though, it is really big for home use. I think we’re looking at a six-figure price for the hardware to run it. Usually, people use Deepseek R1 models. Or other smaller AI models if they run them themselves.