1 237 30

jasonjiang

mikinyaa

jasonjiang8866

AI & ML interests

None yet

Recent Activity

upvoted a paper about 24 hours ago

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

upvoted a paper 5 days ago

STEP3-VL-10B Technical Report

upvoted a paper 6 days ago

Agentic Reasoning for Large Language Models

View all activity

Organizations

None yet

upvoted a paper about 24 hours ago

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Paper • 2601.17058 • Published 7 days ago • 143

upvoted a paper 5 days ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 15 days ago • 189

upvoted a paper 6 days ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published 11 days ago • 182

upvoted a paper 7 days ago

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Paper • 2601.11655 • Published 14 days ago • 60

upvoted a paper 8 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published 13 days ago • 64

upvoted a paper 9 days ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published 16 days ago • 146

upvoted a paper 12 days ago

Aligning Text, Code, and Vision: A Multi-Objective Reinforcement Learning Framework for Text-to-Visualization

Paper • 2601.04582 • Published 21 days ago • 10

upvoted 2 papers 13 days ago

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published 15 days ago • 125

Controlled Self-Evolution for Algorithmic Code Optimization

Paper • 2601.07348 • Published 17 days ago • 113

upvoted 3 papers 14 days ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 21 days ago • 51

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 21 days ago • 211

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

Paper • 2601.06789 • Published 18 days ago • 78

upvoted a paper 15 days ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published 18 days ago • 208

upvoted a paper 16 days ago

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published 20 days ago • 164

upvoted a paper 19 days ago

FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction

Paper • 2509.21657 • Published Sep 25, 2025 • 4

upvoted a collection 19 days ago

Qwen3-VL-Embedding

Collection

2 items • Updated 21 days ago • 57

upvoted 2 papers 20 days ago

Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Paper • 2512.23959 • Published about 1 month ago • 110

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published 24 days ago • 104

upvoted a paper 27 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published 29 days ago • 143

upvoted a paper about 1 month ago

Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models

Paper • 2512.21337 • Published Dec 24, 2025 • 31

jasonjiang

AI & ML interests

Recent Activity

Organizations

mikinyaa's activity