4 625 493

r PRO

oceansweep

AI & ML interests

None yet

Recent Activity

upvoted a paper about 3 hours ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

upvoted a paper about 3 hours ago

Qwen3-TTS Technical Report

upvoted a paper about 3 hours ago

Learning to Discover at Test Time

View all activity

Organizations

None yet

upvoted 4 papers about 3 hours ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Paper • 2601.11868 • Published 7 days ago • 17

liked 2 models about 15 hours ago

mlx-community/XortronCriminalComputingConfig-mlx-8Bit

Text Generation • Updated Jun 19, 2025 • 17 • 3

darkc0de/XortronCriminalComputingConfig

Text Generation • 24B • Updated Jul 7, 2025 • 470 • • 105

upvoted a collection about 19 hours ago

Qwen3-TTS

Collection

7 items • Updated 1 day ago • 148

liked a model about 20 hours ago

mixedbread-ai/mxbai-rerank-large-v2

Text Ranking • 2B • Updated Aug 20, 2025 • 16.2k • 124

upvoted a paper 3 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published 8 days ago • 62

liked 3 models 4 days ago

KevinAHM/pocket-tts-onnx

Text-to-Speech • Updated 3 days ago • 7

mlx-community/GLM-4.7-Flash-8bit

Text Generation • 30B • Updated 4 days ago • 2.65k • 13

mlx-community/GLM-4.7-Flash-4bit

Text Generation • 30B • Updated 4 days ago • 2.95k • 39

liked a model 5 days ago

nvidia/personaplex-7b-v1

Audio-to-Audio • Updated about 21 hours ago • 17k • 694

upvoted a paper 6 days ago

Deriving Character Logic from Storyline as Codified Decision Trees

Paper • 2601.10080 • Published 9 days ago • 6

upvoted a paper 10 days ago

Lost in the Noise: How Reasoning Models Fail with Contextual Distractors

Paper • 2601.07226 • Published 12 days ago • 30

liked 3 models 10 days ago

YatharthS/NovaSR

Audio-to-Audio • Updated 4 days ago • 502 • 66

zai-org/GLM-Image

Text-to-Image • Updated 9 days ago • 11.7k • • 966

Supertone/supertonic-2

Text-to-Speech • Updated 18 days ago • 18.3k • 324

upvoted a paper 14 days ago

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Paper • 2601.03986 • Published 16 days ago • 34

upvoted a paper 16 days ago

OpenRT: An Open-Source Red Teaming Framework for Multimodal LLMs

Paper • 2601.01592 • Published 19 days ago • 12

r PRO

AI & ML interests

Recent Activity

Organizations

oceansweep's activity