AI & ML interests

Hardware-aware AI Model Optimization

Recent Activity

Nota AI Banner

Nota AI bridges the gap between high-performance AI models and edge devices.
From our automated optimization platform to bespoke AI solutions, we ensure your AI functions efficiently—everywhere it is needed.

Website LinkedIn NetsPresso


🌟 Spotlight

World Best LLM (WBL) Project

Nota AI participates in the 'World Best LLM' (WBL) project, a key initiative by the South Korean government (NIPA) to develop global-tier foundation models. As a core optimization partner, we focus on compressing massive LLMs for practical deployment.

🔥 New Release: Qwen3-30B-A3B-NotaMoEQuant-Int4

4-bit Quantization for Mixture-of-Experts (MoE)

This model demonstrates our proprietary NotaMoEQuant technology applied to the Qwen3-30B architecture.

  • Optimization Tech: NotaMoEQuant (Int4 Quantization for Active Parameters).
  • Key Benefit: Significantly reduces memory bandwidth requirements while maintaining reasoning capabilities of the 30B MoE model.
  • Target: Efficient inference on consumer-grade GPUs and edge servers.

🚀 Our Core Business

🛠️ AI Platform: NetsPresso

"We make AI lighter, faster, and ready for deployment."

NetsPresso is our proprietary platform that accelerates model optimization, enabling you to secure on-device latency and accuracy without deep hardware expertise.

  • Develop & Compress: Create lightweight models effortlessly using our Model Zoo and advanced Compressor (Structured Pruning).
  • Optimize & Convert: Maximize speed on verified hardware (NVIDIA, Arm, Qualcomm, etc.) with Graph Optimization and Graph Quantization.
  • Test on Real Devices: Validate performance instantly on actual devices via our Device Farm to eliminate deployment failures.

👉 Ready to optimize? Try NetsPresso Now | View Documentation

🌍 AI Solutions

"We provide end-to-end AI solutions powered by our core optimization technology."

1. Nota Vision Agent

Powered by Vision Language Models (VLM), this agent goes beyond simple detection to understand complex situational contexts.
It interprets video feeds through natural language prompts, delivering real-time insights locally without cloud dependency.

2. Edge AI Solutions

📚 Tech Blog

Gain insights into our engineering philosophy. We share deep dives into model compression methodologies and NPU acceleration techniques to help you stay ahead.

👉 Read our Tech Blog

🔗 Connect with Us


© 2026 Nota Inc. All rights reserved.