In a Training Loop 🔄

527 2316 28340

John Smith PRO

John6666

John6666cat

AI & ML interests

None yet

Recent Activity

reacted to efecelik's post with 🔥 about 1 hour ago

We Built a Music App with ACE-Step – Looking for Feedback Hey everyone, We've been building AceSteps – a platform where anyone can create music using the ACE-Step model (https://huggingface.co/ACE-Step/ACE-Step-v1-3.5B). You can mint your tracks as NFTs, tokenize them into 100,000 fractional shares, and trade them on Uniswap V4. When your song gets popular, token holders earn from ad revenue automatically. It's a Farcaster Mini-App on Base Network. But we want to make it better, and we'd love your input: What's the one feature that would make you actually use an AI music tool regularly? Andd any suggestions on how we can make this model better? Actually sharing here for this question. 🤗 Any feedback, ideas, or critiques are welcome. 🔗 https://docs.acesteps.com/ 🔗 https://docs.acesteps.com/pitch-deck.html 🔗 https://farcaster.xyz/?launchFrameUrl=https%3A%2F%2Fwww.acesteps.com%2F 🔗 https://www.acesteps.com

reacted to FreshmanD's post with 🔥 about 1 hour ago

🥇 23 Kaggle Gold Medals. One Agent Framework. Introducing LoongFlow: A Thinking & Learning Framework for Expert-Grade AI Agents. Unlike traditional evolve agents(like OpenEvolve-Style), LoongFlow implements the PES (Plan-Execute-Summary) paradigm to learn from mistakes and avoid local optima. 🚀 Highlights: * SOTA: Surpassed human mathematicians on 11 geometry/algebra problems. * 23 Kaggle Gold Medals on MLE Bench. * Efficiency: 60% more efficient than current baselines. 🔗 Code & Paper: https://github.com/baidu-baige/LoongFlow https://huggingface.co/papers/2512.24077 #AutoML #Kaggle #Agents #OpenSource #LLM

reacted to dhruv3006's post with 👀 about 1 hour ago

The latest beta of Voiden - the API client built to treat API work like code - is now available. This is a significant release that is addressing specific feedback we got + it expands some core capabilities and plugins, improving Voiden’s overall performance at scale. What’s new: 🔹 GraphQL support : You can now work with GraphQL APIs side-by-side with REST, gRPC, and WSS using the same file-based, version-controlled workflow in linux, windows and macOS. 🔹 gRPC and WSS (Windows support): Full gRPC and WSS support is now available on Windows, bringing feature parity across platforms. 🔹 Faster performance for large OpenAPI specs : Opening large OpenAPI files is now significantly faster. We fixed inefficient re-renders that weren’t noticeable in small specs but caused lag with heavy schemas. Rendering is now optimized using React hooks to avoid unnecessary updates. Additional improvements: 🔹 Voiden now handles imperfect specs more gracefully 🔹 Project uninstalling and support for setting a default directory for project creation 🔹 .env files are now editable 🔹 Improved text contrast for error messages Try the beta here : https://voiden.md/beta Share feedback: https://github.com/VoidenHQ/feedback/issues We keep building in public, and your feedback directly shapes what’s next. P.S A stable release will follow soon, in the next days.

View all activity

Organizations

reacted to efecelik's post with 🔥 about 1 hour ago

Post

135

We Built a Music App with ACE-Step – Looking for Feedback

Hey everyone,

We've been building AceSteps – a platform where anyone can create music using the ACE-Step model ( ACE-Step/ACE-Step-v1-3.5B). You can mint your tracks as NFTs, tokenize them into 100,000 fractional shares, and trade them on Uniswap V4. When your song gets popular, token holders earn from ad revenue automatically. It's a Farcaster Mini-App on Base Network.

But we want to make it better, and we'd love your input:

What's the one feature that would make you actually use an AI music tool regularly?
Andd any suggestions on how we can make this model better? Actually sharing here for this question. 🤗

Any feedback, ideas, or critiques are welcome.
🔗 https://docs.acesteps.com/
🔗 https://docs.acesteps.com/pitch-deck.html
🔗 https://farcaster.xyz/?launchFrameUrl=https%3A%2F%2Fwww.acesteps.com%2F
🔗 https://www.acesteps.com

reacted to FreshmanD's post with 🔥 about 1 hour ago

Post

🥇 23 Kaggle Gold Medals. One Agent Framework.

Introducing LoongFlow: A Thinking & Learning Framework for Expert-Grade AI Agents.

Unlike traditional evolve agents(like OpenEvolve-Style), LoongFlow implements the PES (Plan-Execute-Summary) paradigm to learn from mistakes and avoid local optima.

🚀 Highlights:
* SOTA: Surpassed human mathematicians on 11 geometry/algebra problems.
* 23 Kaggle Gold Medals on MLE Bench.
* Efficiency: 60% more efficient than current baselines.

🔗 Code & Paper:
https://github.com/baidu-baige/LoongFlow
LoongFlow: Directed Evolutionary Search via a Cognitive Plan-Execute-Summarize Paradigm (2512.24077)

#AutoML #Kaggle #Agents #OpenSource #LLM

reacted to dhruv3006's post with 👀 about 1 hour ago

Post

The latest beta of Voiden - the API client built to treat API work like code - is now available.

This is a significant release that is addressing specific feedback we got + it expands some core capabilities and plugins, improving Voiden’s overall performance at scale.

What’s new:

🔹 GraphQL support :
You can now work with GraphQL APIs side-by-side with REST, gRPC, and WSS using the same file-based, version-controlled workflow in linux, windows and macOS.
🔹 gRPC and WSS (Windows support):
Full gRPC and WSS support is now available on Windows, bringing feature parity across platforms.
🔹 Faster performance for large OpenAPI specs :
Opening large OpenAPI files is now significantly faster. We fixed inefficient re-renders that weren’t noticeable in small specs but caused lag with heavy schemas. Rendering is now optimized using React hooks to avoid unnecessary updates.

Additional improvements:

🔹 Voiden now handles imperfect specs more gracefully
🔹 Project uninstalling and support for setting a default directory for project creation
🔹 .env files are now editable
🔹 Improved text contrast for error messages

Try the beta here : https://voiden.md/beta

Share feedback: https://github.com/VoidenHQ/feedback/issues

We keep building in public, and your feedback directly shapes what’s next.

P.S A stable release will follow soon, in the next days.

reacted to zc277584121's post with 🔥 about 1 hour ago

Post

513

We've open-sourced a bilingual Semantic Highlighting model that can power multiple production scenarios:

1) RAG Answer Highlighting — Automatically highlight the exact sentences that answer user queries, improving interpretability and helping users quickly locate relevant information.
2) RAG Noise Filtering — Prune irrelevant context before sending to LLMs, achieving 70-80% token cost reduction while improving answer quality by letting the model focus on what matters.
3) Search System Highlighting — Add semantic highlighting features to recommendation systems, e-commerce search, or any retrieval system where users need to see why a result is relevant.

Try it out: zilliz/semantic-highlight-bilingual-v1
Read our article: https://huggingface.co/blog/zilliz/zilliz-semantic-highlight-model

reacted to TravisMuhlestein's post with 👀 about 1 hour ago

Post

125

Designing an acquisition agent around intent and constraints

We recently shared how we built an acquisition agent for GoDaddy Auctions, and one thing stood out: autonomy is easy to add—intent is not.

Rather than optimizing for agent capability, the design centered on:

-making user intent explicit and machine-actionable
-defining clear constraints on when and how the agent can act
-integrating tightly with existing systems, data, and trust boundaries

In our experience, this framing matters more than model choice once agents move into production environments.

The article describes how we approached this and what we learned when intent and constraints became core architectural inputs.

Link:
https://www.godaddy.com/resources/news/godaddy-auctions-building-the-acquisition-agent

Would love to hear how others here think about intent representation and guardrails in agentic systems.

1 reply

reacted to wangbuer999's post with 🔥 about 1 hour ago

Post

1612

HY-MT1.5-1.8B Lightweight Translation Model Open-Source Game-Changer

Tencent raised the bar for lightweight translation!

Supports bidirectional translation across 36 languages total—33 mainstream languages + 5 ethnic/minority dialects

With only 1.8B parameters (less than 1/3 the size of HY-MT1.5-7B), it delivers performance on par with the 7B counterpart and outperforms most commercial translation APIs.

✅ Quantized versions (FP8/GPTQ-Int4) available for edge device deployment, perfect for real-time translation
✅ Full support for terminology intervention, context-aware translation, and formatted output
✅ Ready-to-use prompt templates + seamless integration with Hugging Face Transformers
✅ Recommended transformers ≥ 4.56.0 (FP8 model requires compressed-tensors 0.11.0)

10+ Hugging Face Spaces already integrated this model!

👉 Model Repo: tencent/HY-MT1.5-1.8B
👉 Technical Report: https://arxiv.org/abs/2512.24092

reacted to hypothetical's post with 🚀 about 1 hour ago

Post

831

We thought it would be easier, but finally we have integrated CuDNN Paged Attention to our models!

Read article here: https://app.thestage.ai/blog/Integrating-cuDNN-Paged-Attention-to-TheStage-AI-Inference?id=8

Llama-8B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Llama-3.1-8B-Instruct
Mistral-Small-24B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503

reacted to danielhanchen's post with 🔥 about 1 hour ago

Post

739

You can now do reinforcement learning training with 7× longer context and no accuracy loss, via our new batching algorithms.

Long reasoning chains in RL are costly, but now we enable you to train gpt-oss with GRPO & reach 380K context on a 192GB GPU.

Blog: https://unsloth.ai/docs/new/grpo-long-context

reacted to efecelik's post with 👍 about 1 hour ago

Post

135

reacted to AdinaY's post with ❤️ about 1 hour ago

Post

166

We have a new heatmap live on huggingface now🔥

woojun-jung/open-source-release-heatmap-ko

Korean community built their own version to track labs that actively publish open work, inspired by Chinese open source heat map!

This is the open source community at its best ♥️

reacted to unmodeled-tyler's post with 🚀 about 1 hour ago

Post

174

Hey Hugging Face! I just wanted to share something I've been working on lately. This is Continuum, an app that started as a regular chat interface but quickly spiraled into much more!

The left panel contains settings, different project workspaces with associated chat sessions, and the model drop down menu.

The middle panel is the chat window with engaging color schemes for italics or bold characters.

The right panel is the "Loom" - a collaborative document workspace for the AI model and the user to work together in markdown with a live preview toggle switch.

The Loom supports differential edits allowing the user to reject, approve, or edit each model change/addition. Right now, Continuum will support BYOK, OAI compatible endpoints, and local models served through ollama/llama.cpp

It's still very much a work in progress but I'm really happy with how it's coming along so far. I'm excited to share this demo with all of you when it's ready!

reacted to RakshitAralimatti's post with 🔥 about 1 hour ago

Post

131

I built a crazy ultra–low latency voice assistant agent using Pipecat, NVIDIA Riva, NVIDIA NIM, and an MCP‑powered tool stack. It can talk in real time, search the web, and manage your project directory files, document your code and docs hands‑free (create, read, summarise, and clean up).

Link - https://github.com/rakshit2020/Voice-Agent-using-Nvidia-Riva-NIM-Pipecat
I put everything into a small demo repo with the full architecture diagram and a short demo video so you can see exactly how it works and adapt it to your own projects.

Check out the GitHub, play with the agent, and let me know if it’s useful or if you want a breakdown of any part of the setup.

1 reply

reacted to dhruv3006's post with 👀 about 1 hour ago

Post

142

Voiden makes it easy to work with APIs that use API Key Authentication by giving you a clean and organized way to attach API keys to every request.

API keys are a common way for APIs to identify and authorize clients. With Voiden, every request sent to your workspace APIs automatically includes the correct API key - so the API provider always knows who’s calling and whether the request is allowed.

Each client or service uses a unique API key, acting as a secure identifier attached to every request.

How it works in Voiden :

When you configure API Key Authentication, you simply:
- Choose where the API key is sent (header, query, or cookie)
- Define the key name
- Provide the API key value

That’s it. Voiden takes care of the rest by automatically attaching the API key to every request in your workspace.

Try Voiden here : https://voiden.md/

reacted to AbstractPhil's post with 🧠 about 1 hour ago

Post

140

pytorch-parallel-compiler v0.5.0 upgrades:
*Complex benchmarking for wide primitive objects is supported now. This includes multiple presets for quick tests on hardware.
* All supported primitive either have validity checks or will have them.
* 6 new wide layers supported directly, and will be a key part to the autotuner before v1.0
* WideTracedModel is a preliminary auto-builder so the user doesn't need to build them manually by gathering layers.

https://github.com/AbstractEyes/pytorch-parallel-compiler

New Layers for 0.5.0:
WideGRU, WideLSTM, WideGroupNorm, WideMultiheadedAttention, WideInstancenorm1/2d, WideConv3d,

Upcoming for 1.0:
* WideTracedModel fully building any supported layer patterns with multiple autotune potentials for autoselection.
* Module cherry-picking for use-case only; E.G. WideLinear replace only benefits your case 35% while Attention reduces by 10% no attn.
* All (roughly 32 more) commonly used pytorch layer systems supported in one form or another with wide-batched kernels to benefit both eager and compiled, many of which require reworks or completely remaking them.
* Autotuning wide formats based on hardware response to the kernels. Kernel chunking for big slow processes such as LSTM, kernel fusion for small process with excess overhead, expanding kernels with masking to fit specific use-case paradigms with hardwares, and a series of smaller and more important optimizations along the way.
* Full transformer and rope support with wide-batched optimizations throughout the structures to allow more robust autoregression throughput.
* Additional Conv1d, Conv2d, and Conv3d optimizations.

>version 1.0 :
* Entire diffusion structures specifically kernelized for high-efficiency utilization with eager and compilation.
* Video diffusion specific targets meant to heavily reduce computation costs on the gpu and increase computation throughput on the gpu.

reacted to AdinaY's post with 🔥 about 1 hour ago

Post

197

Agentic capability is the new battleground🔥

LongCat-Flash-Thinking-2601, the latest reasoning model from Meituan- LongCat

✨ MoE - 560B total / 27B active
✨ MIT license
✨ Agentic tool use
✨ Multi-environment RL
✨ Parallel + iterative reasoning

meituan-longcat/LongCat-Flash-Thinking-2601

reacted to nyuuzyou's post with 👍 about 1 hour ago

Post

150

🇨🇳 GitCode Dataset - Continuing the Chinese Code Series nyuuzyou/gitcode-code

Following up on the Gitee release, here's another major Chinese code dataset from GitCode (CSDN's code hosting platform). Same pipeline, same clean format, more valuable data from China's developer ecosystem.

Key Stats:
- 48,142,567 files from 85,632 repositories
- 40 GB compressed Parquet storage
- 537 programming languages
- Extensive quality filtering applied
- Rich metadata: repo names, file paths, licenses, and sizes

The final dataset in the Chinese code series is also available: nyuuzyou/jihulab-code. It's smaller in size but shares the same pipeline and formatting.

reacted to wangbuer999's post with 🚀 about 1 hour ago

Post

2821

Qwen-Image-Edit LoRA 96 Camera Angles for 3D-Consistent Image Tweaks

fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA levels up perspective editing

96 poses (4 elevations × 8 azimuths × 3 distances) – close-ups, wide shots, all angles covered

Trained on 3000+ Gaussian Splatting renders – 3D consistency holds even for -30° low-angle shots

Works with Qwen/Qwen-Image-Edit-2511 base models (LoRA strength 0.8-1.0) + ComfyUI workflow included
Tested it – plug-and-play, no fussy setup.

fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA

reacted to YatharthS's post with 🚀 about 1 hour ago

Post

3310

I just released NovaSR, a tiny 52kb audio upsampler that can enhance 3600 seconds of muffled 16khz audio in to clearer 48khz audio in just 1 second!

NovaSR can
- Enhance TTS model quality.
- Restore poor quality datasets.
- Work on any device(just 52kb which is smaller than a 3 second audio file!)

Model: YatharthS/NovaSR
Space to try it: YatharthS/NovaSR
Github repo: https://github.com/ysharma3501/NovaSR

1 reply

reacted to kanaria007's post with 👀 about 1 hour ago

Post

122

✅ New Article: *Designing Goal-Native Algorithms* (v0.1)

Title:
🎯 Designing Goal-Native Algorithms: From Heuristics to GCS
🔗 https://huggingface.co/blog/kanaria007/designing-goal-native-algorithms

---

Summary:
Most systems still run on “Inputs → model/heuristic → single score → action”.
But real deployments have multiple goals plus non-negotiable constraints (safety, ethics, legal).
This article is a design cookbook for migrating to goal-native control: make the goal surface explicit as a **GCS vector**, enforce **hard constraints first**, then trade off soft objectives inside the safe set.

> The primary object is a GCS vector + constraint status — not a naked scalar score.

---

Why It Matters:
• Stops safety/fairness from becoming silently tradable via “mystery weights”
• Makes trade-offs auditable: “why this action now?” can be reconstructed via Effect Ledger logging
• Gives a repeatable build flow: goals → constraints → action space → GCS estimator → chooser
• Shows how to ship safely: shadow mode → thresholds → canary, with SI metrics (CAS/SCover/EAI/RIR)

---

What’s Inside:
• A recommended GCS convention (higher=better, scales documented, weights only for soft goals)
• Chooser patterns: lexicographic tiers, Pareto frontier, context-weighted tie-breaks
• Practical patterns: rule-based+GCS wrapper, safe bandits, planning/scheduling, RL with guardrails
• Migration path from legacy heuristics + common anti-patterns (single-scalar collapse, no ledger, no PLB/RML)
• Performance tips: pruning, caching, hybrid estimators, parallel evaluation

---

📖 Structured Intelligence Engineering Series
Formal contracts live in SI-Core / GCS specs and the eval packs; this is the *how-to-design / how-to-migrate* layer.

reacted to AdinaY's post with 🔥 about 1 hour ago

Post

244

More lightweight multimodal models are coming 👀

StepFun has been focused on multimodal AI from the very beginning. Their latest release a new foundational model: STEP3-VL🔥
https://huggingface.co/collections/stepfun-ai/step3-vl-10b
✨ 10B - Apache2.0
✨ Leads in the 10B class and competes with models 10–20× larger

John Smith PRO

AI & ML interests

Recent Activity

Organizations

John6666's activity