DeepSeek AI Models 8 min read

DeepSeek Guide 2025: R1 Model, Chat & API Setup

DeepSeek is a Chinese AI lab that released two landmark open-source models — DeepSeek V3 (general-purpose) and DeepSeek R1 (reasoning) — both under the MIT license. This guide covers the free chat interface, running R1 locally with Ollama, the OpenAI-compatible API, Python examples, and how DeepSeek R1 compares to OpenAI o1.

1. What is DeepSeek?

DeepSeek is a Chinese AI research lab (founded 2023, backed by High-Flyer hedge fund) that develops and open-sources large language models. Their key breakthrough: achieving GPT-4o and o1-level performance at a fraction of the training cost, then releasing the weights for free under the MIT license.

Why DeepSeek matters

MIT license — use commercially for free, modify, deploy anywhere
Open weights — run locally on your own hardware via Ollama or vLLM
Cheapest frontier-class API — $0.14/1M input tokens for V3 (vs $5/1M for GPT-4o)
OpenAI-compatible API — drop-in replacement, just change base_url
Reasoning model (R1) — chain-of-thought reasoning competitive with OpenAI o1

Privacy note: DeepSeek stores conversation data on servers in China. For sensitive data (business secrets, personal info, regulated industries), use the API and review their privacy policy, or run DeepSeek R1 locally with Ollama where data never leaves your machine.

2. DeepSeek R1 vs V3: Which to Use?

DeepSeek has two flagship models with very different purposes. Choosing the right one makes a big difference in quality and cost.

Feature DeepSeek R1 DeepSeek V3
Type Reasoning model General-purpose
Best for Math, logic, complex coding, scientific reasoning Writing, chat, summarization, coding, Q&A
API model ID deepseek-reasoner deepseek-chat
API price (input) $0.55/1M tokens $0.14/1M tokens
API price (output) $2.19/1M tokens $0.28/1M tokens
Response time Slower (generates reasoning chain first) Faster
Ollama command ollama run deepseek-r1 ollama run deepseek-v3
Comparable to OpenAI o1 GPT-4o

When to pick R1 vs V3

Use R1 when:

  • Solving math problems step by step
  • Debugging complex code logic
  • Scientific reasoning or proofs
  • Multi-step planning tasks

Use V3 when:

  • Writing, summarizing, translating
  • General Q&A and chat
  • Building cost-efficient apps
  • Fast responses needed

3. Using DeepSeek Chat (Free)

The easiest way to try DeepSeek — no account, no API key, no credit card. Just open the browser and start chatting.

1

Go to chat.deepseek.com

Navigate to chat.deepseek.com in your browser. No login required for basic usage.

2

Select DeepSeek V3 or R1

The model selector is at the top of the chat. Choose DeepSeek V3 for general tasks or DeepSeek R1 (labeled “DeepThink”) for math and reasoning. R1 shows its thinking process visibly.

3

Type your message and send

DeepSeek V3 responds quickly. R1 (DeepThink) first generates a visible reasoning chain, then the answer — this is normal and usually takes 10–30 seconds for complex problems.

Limitations of the free chat

Rate limits during peak hours (mostly evenings UTC+8)
Data stored on Chinese servers — don't use for sensitive info
No API access, file uploads, or plugins in free mode
Context window limited in the web interface

4. DeepSeek R1 on Ollama (Local, Free)

Running DeepSeek R1 locally with Ollama gives you complete privacy — no data leaves your machine. It's free after the initial download (the weights are MIT licensed). You'll need at least 8GB RAM for the 7B model, 32GB+ for 70B.

1

Install Ollama from ollama.com

Download and install Ollama for your OS (Mac, Windows, Linux). Runs as a background service.

2

Pull and run DeepSeek R1

ollama run deepseek-r1

This downloads the 7B model by default (~4.7GB). For larger variants: ollama run deepseek-r1:14b, ollama run deepseek-r1:32b, or ollama run deepseek-r1:70b.

3

Chat in the terminal

Ollama starts an interactive session. R1 shows its thinking process inline — you'll see it reason through problems step by step before giving the final answer. Type /bye to exit.

Model sizes and RAM requirements

Ollama command Size Min RAM Notes
ollama run deepseek-r1 7B 8GB Fastest, runs on most machines
ollama run deepseek-r1:14b 14B 16GB Better reasoning quality
ollama run deepseek-r1:32b 32B 32GB Near API-quality local
ollama run deepseek-r1:70b 70B 64GB Full R1 capability

5. DeepSeek API Setup

The DeepSeek API is fully OpenAI-compatible — same request/response format, same SDK. You only need to change two things: the base URL and the model name.

1

Create account at platform.deepseek.com

Sign up at platform.deepseek.com and add API credits. New accounts may receive free trial credits.

2

Generate an API key

In the platform dashboard, go to API Keys and click Create. Store the key securely — it won't be shown again.

3

Set as environment variable

export DEEPSEEK_API_KEY="sk-..."

API pricing

Model Input (cached) Input (non-cached) Output
deepseek-chat (V3) $0.014/1M $0.14/1M $0.28/1M
deepseek-reasoner (R1) $0.055/1M $0.55/1M $2.19/1M

Prices per million tokens. Context caching (when the same prefix is reused) gives a 10× input cost reduction.

6. Python Code Examples

The DeepSeek API uses the OpenAI Python SDK — just set a different base_url. Install with pip install openai.

DeepSeek V3 (general chat)

import openai

client = openai.OpenAI(
    api_key="YOUR_DEEPSEEK_API_KEY",
    base_url="https://api.deepseek.com"
)

response = client.chat.completions.create(
    model="deepseek-chat",  # DeepSeek V3
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Explain quantum entanglement simply."}
    ]
)
print(response.choices[0].message.content)

DeepSeek R1 (reasoning with thinking output)

# Use DeepSeek R1 for reasoning tasks
response = client.chat.completions.create(
    model="deepseek-reasoner",  # DeepSeek R1
    messages=[
        {"role": "user", "content": "Solve: If a train travels at 80 mph for 2.5 hours, then 60 mph for 1.5 hours, what is the average speed for the whole trip?"}
    ]
)
# R1 returns a reasoning_content field with the chain-of-thought
print(response.choices[0].message.reasoning_content)
print(response.choices[0].message.content)

R1 returns a reasoning_content field containing the chain-of-thought, plus content with the final answer.

7. DeepSeek R1 vs OpenAI o1: Comparison

DeepSeek R1 directly competes with OpenAI o1 as a reasoning model. On most benchmarks, they perform similarly — but R1 costs 1/30th the price.

Metric DeepSeek R1 OpenAI o1
Input price (1M tokens) $0.55 $15
Output price (1M tokens) $2.19 $60
AIME 2024 (math) 79.8% 74.4%
GPQA (science) 71.5% 78.0%
SWE-bench (coding) 49.2% 48.9%
Open source? Yes (MIT) No (closed)
Run locally? Yes (Ollama) No
Data location China (API) / Local (Ollama) USA

Verdict: For most reasoning tasks, DeepSeek R1 is an excellent OpenAI o1 replacement at 1/30th the cost. Use o1 if you need maximum reliability, US data residency, or GPT-4o-compatible multimodal features (o1 supports images, R1 does not).

8. Privacy Considerations

DeepSeek Chat (chat.deepseek.com)

All conversations are stored on servers in China. Subject to Chinese data laws. Do not use for: confidential business data, legal documents, personal health information, trade secrets, or anything regulated under GDPR/HIPAA/SOC 2.

DeepSeek API (api.deepseek.com)

API requests are also processed on Chinese infrastructure. Better than the chat interface (no conversation history stored by default), but still subject to the same data jurisdiction. Review platform.deepseek.com/privacy before using for sensitive applications.

✅ Safest option: Run locally with Ollama

Running ollama run deepseek-r1 means model weights are on your machine and all inference is done locally. Zero data leaves your computer — the model never sends requests anywhere. This is the recommended approach for sensitive data.

🔔

Monitor DeepSeek API status at Prismix

DeepSeek can experience capacity issues during peak hours. Monitor live status at Prismix and get free email alerts the moment it goes down — before your users notice.

FAQ

What is DeepSeek?

DeepSeek is a Chinese AI research lab that develops open-source large language models. Their main models are DeepSeek V3 (general-purpose) and DeepSeek R1 (reasoning-focused). Both are released under the MIT license, meaning they can be used commercially for free. DeepSeek's API is OpenAI-compatible, making it easy to switch from OpenAI at a fraction of the cost.

Is DeepSeek R1 better than OpenAI o1?

DeepSeek R1 is competitive with OpenAI o1 on reasoning benchmarks (AIME math, GPQA science, SWE-bench coding) at roughly 1/30th the API cost: $0.55/1M input tokens vs $15/1M for o1. R1 uses reinforcement learning to develop a chain-of-thought reasoning process similar to o1. For most coding and math tasks, R1 is an excellent and much cheaper alternative to o1.

Is DeepSeek free to use?

DeepSeek Chat (chat.deepseek.com) is free to use with no account required. The DeepSeek API costs $0.14/1M input tokens for V3 and $0.55/1M for R1 — among the cheapest in the industry. You can also run DeepSeek R1 locally for free using Ollama. The model weights themselves are open-source under MIT license.

Is DeepSeek safe to use for sensitive data?

DeepSeek stores data on servers in China, which may be subject to Chinese government data access laws. For sensitive business, legal, or personal data, use the DeepSeek API (rather than the chat interface) and check their privacy policy, or run DeepSeek R1 locally via Ollama where no data leaves your machine.