Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

agnos.is Forums

  1. Home
  2. LocalLLaMA
  3. DeepSeek R2 AI Model Rumors Begin to Swirl Online; Reported to Feature 97% Lower Costs Compared to GPT-4 & Fully Trained on Huawei's Ascend Chips

DeepSeek R2 AI Model Rumors Begin to Swirl Online; Reported to Feature 97% Lower Costs Compared to GPT-4 & Fully Trained on Huawei's Ascend Chips

Scheduled Pinned Locked Moved LocalLLaMA
localllama
2 Posts 2 Posters 2 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • C This user is from outside of this forum
    C This user is from outside of this forum
    [email protected]
    wrote on last edited by
    #1
    This post did not contain any content.
    B 1 Reply Last reply
    1
    56
    • C [email protected]
      This post did not contain any content.
      B This user is from outside of this forum
      B This user is from outside of this forum
      [email protected]
      wrote on last edited by
      #2

      1.2T param, 78B active, hybrid MoE

      That's enormous, very much not local, heh.

      Here's the actual article translation (which seems right comparing to other translations):

      ::: spoiler Translation
      DeepSeek R2: Unit Cost Drops 97.3%, Imminent Release + Core Specifications

      Author: Chasing Trends Observer
      Veteran Crypto Investor Watching from Afar
      2025-04-25 12:06:16 Sichuan

      Three Core Technological Breakthroughs of DeepSeek R2:

      1. Architectural Innovation
        Adopts proprietary Hybrid MoE 3.0 architecture, achieving 1.2 trillion dynamically activated parameters (actual computational consumption: 78 billion parameters).
        Validated by Alibaba Cloud tests:
      • 97.3% reduction in per-token cost compared to GPT-4 Turbo for long-text inference tasks
        (Data source: IDC Computing Power Economic Model)
      1. Data Engineering
        Constructed 5.2PB high-quality corpus covering finance, law, patents, and vertical domains.
        Multi-stage semantic distillation boosts instruction compliance accuracy to 89.7%
        (Benchmark: C-Eval 2.0 test set)

      2. Hardware Optimization
        Proprietary distributed training framework achieves:

      • 82% utilization rate on Ascend 910B chip clusters
      • 512 PetaFLOPS actual computing power at FP16 precision
      • 91% efficiency of equivalent-scale A100 clusters
        (Validated by Huawei Labs)

      Application Layer Advancements - Three Multimodal Breakthroughs:

      1. Vision Understanding
        ViT-Transformer hybrid architecture achieves:
      • 92.4 mAP on COCO dataset object segmentation
      • 11.6% improvement over CLIP models
      1. Industrial Inspection
        Adaptive feature fusion algorithm reduces false detection rate to 7.2E-6 in photovoltaic EL defect detection
        (Field data from LONGi Green Energy production lines)

      2. Medical Diagnostics
        Knowledge graph-enhanced chest X-ray multi-disease recognition:

      • 98.1% accuracy vs. 96.3% average of senior radiologist panels
        (Blind test results from Peking Union Medical College Hospital)

      Key Highlight:
      8-bit quantization compression achieves:

      • 83% model size reduction
      • <2% accuracy loss
        (Enables edge device deployment - Technical White Paper Chapter 4.2)
        :::

      Others translate it as 'sub-8-bit' quantization, which is interesting too.

      1 Reply Last reply
      6
      • System shared this topic on
      Reply
      • Reply as topic
      Log in to reply
      • Oldest to Newest
      • Newest to Oldest
      • Most Votes


      • Login

      • Login or register to search.
      • First post
        Last post
      0
      • Categories
      • Recent
      • Tags
      • Popular
      • World
      • Users
      • Groups