• Phone FeatureAndroid Client
  • Phone FeatureiOS Client
  • Phone FeatureWeChat Mini Program
  • Phone FeatureH5 Page
  • 1
    Return

    Meta-analysis of large language models: benchmarking DeepSeek-R1 against ChatGPT, Gemini, Qwen, and LLaMA

    10.1186/s40537-025-01330-3
    2025-12-19
    0
    OA
    PDF
    AI
    Save
    Share
    Original
    View PDF
    Abstract

    Abstract

    En 中文
    The rapid evolution of large language models (LLMs), GPT-4 Turbo, Google Gemini, Qwen, Meta’s LLaMA 3.1, and DeepSeek-R1 has redefined the landscape of artificial intelligence. In the study, we conduct a hybrid meta-analysis integrating publicly available benchmarks, model cards, technical reports, and open-source repositories to evaluate LLMs across both performance and operational dimensions. Quantitative data were aggregated from standardized tasks such as MMLU (reasoning), HumanEval (code generation), FLORES-200 (translation), and TyDiQA (multilingual Q&A), complemented by efficiency metrics including FLOPs, GPU hours, inference latency, and subscription costs. A big data–driven KPI framework covering scalability index, data-throughput rate, energy per token, and training cost efficiency was applied to enable normalized, cross-model comparison. Results indicate that DeepSeek-R1 demonstrates strong coding and multilingual efficiency, ChatGPT-4 Turbo leads in reasoning accuracy, Gemini Ultra excels in multimodal inference, Qwen is competitive in Chinese-language tasks, and LLaMA 3.1 remains the most adaptable open-source option. Across datasets, DeepSeek-R1 achieved 80.2 ± 1.5% on HumanEval and 78.5 ± 1.8% on MMLU, compared with ChatGPT-4 Turbo’s 86.5 ± 1.9%; these gaps fall within observed heterogeneity (I2 = 14.6%). The findings highlight trade-offs among accuracy, scalability, and cost efficiency, emphasizing the need for transparent, sustainable, and multimodal LLM development.
    Keywords:
    DeepSeek
    LLM
    ChatGPT
    Gemini
    Qwen
    LLaMA
    AR
    Ethics
    AI Summary

    AI Summary

    Key information extracted from the uploaded paper, including a brief overview, abstract, background, key highlights, visual analysis, and future outlook.

    Journal

    Journal

    Journal of Big Data cover
    IF:
    6.4
    Papers: 1.3K
    Citations: 1.1W
    Researchers

    Researchers

    S
    Shafique Ahmed Awan
    H-index:
    0
    Papers: 1
    Citations: 0
    M
    Muazzam Ali Khan Khattak
    H-index:
    0
    Papers: 1
    Citations: 0
    A
    Abdullah Ayub Khan
    H-index:
    0
    Papers: 6
    Citations: 0
    A
    Anwar Ali Sathio
    H-index:
    0
    Papers: 1
    Citations: 0
    J
    Jamil Abedalrahim Jamil Alsayaydeh
    H-index:
    0
    Papers: 1
    Citations: 0
    Researchers View more
    Organization

    Organization

    D
    department of general education and foundation
    Scholars:
    1
    Papers: 1
    Citations: 0
    D
    department of cs
    Scholars:
    9
    Papers: 4
    Citations: 0
    F
    faculty of engineering
    Scholars:
    3.1K
    Papers: 1.7K
    Citations: 1
    D
    Department of Engineering Technology
    Scholars:
    12
    Papers: 9
    Citations: 0
    D
    Department of Computer Science
    Scholars:
    1.7K
    Papers: 998
    Citations: 8
    Cited Papers

    Cited Papers

    Citing Papers

    Citing Papers