Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Osilly 's Collections
Vision-DeepResearch
Dynamic-LLaVA
Interleaving Reasoning Generation
Vision-R1

Vision-DeepResearch

updated 5 days ago
Upvote
3

  • Osilly/Vision-DeepResearch-Toy-SFT-Data

    Viewer • Updated 6 days ago • 1k • 70

  • Osilly/Vision-DeepResearch-Toy-RL-Data

    Viewer • Updated 6 days ago • 1k • 29

  • Osilly/VDR-Bench

    Viewer • Updated 6 days ago • 2k • 46

  • Osilly/VDR-Bench-testmini

    Viewer • Updated 6 days ago • 500 • 21

  • Osilly/Vision-DeepResearch-8B

    9B • Updated 6 days ago • 25 • 3

  • Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

    Paper • 2601.22060 • Published 9 days ago • 147

  • Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

    Paper • 2602.02185 • Published 6 days ago • 123
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs