- Published on
Who has the best deep research agent? Reading Summary
- Authors
- Name
- LI Tian
I have the such questions, 'Grok, ChatGPT o1 Pro, Gemini 2.0 Flash Thinking— which one is better? What are the criteria for evaluation? Is Deep Research in model companies the same as Deep Research in the industry?'
Reading Resource:
https://www.youtube.com/watch?v=CF9IDZoznQY
Reference:
Deep Research Head2Head: Brutally Honest 5-Dimension Test
Q1: Research Report
GIVEN the following prompt
Compare reasoning model API providers:
- DeepSeek R1 official API
- OpenRouter DeepSeek R1 API
- Gemini 2.0 flash thinking API
- o3-mini API
- Perplexity reasoner API
Compare their output format, whether they output thinking tokens, and whether they can stream them. You must include the API output format or relevant code snippet in your answer. Also link relevant document.
Conclusion




Q2 - Technical Question
GIVEN the following prompt,
Survey the best Client-side JS library for hybrid search (full text and vector search combined), rank them by popularity, reliability, memory footprint, and ease of use. Present any performance benchmarks, release activity and references to documentation or community support that provide insight into their overall stability and adoption.
TEST the performance of ChatGPT, Gemini and Perplexity Deep Research.
Conclusion




Benchmark of Deep Research



