35 128

Charleno Pires

charleno

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

upvoted an article 2 days ago

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

upvoted an article 2 days ago

New in llama.cpp: Anthropic Messages API

View all activity

Organizations

None yet

upvoted a paper 1 day ago

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Paper • 2410.10813 • Published Oct 14, 2024 • 16

upvoted 2 articles 2 days ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

Article

New in llama.cpp: Anthropic Messages API

Jan 19

•

upvoted a changelog 3 days ago

Hugging Face Changelog

Agent Traces on the Hub

4 days ago

• 91

upvoted an article 17 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

292

upvoted a collection about 2 months ago

GLiNER-bi-V2

Collection

4 items • Updated Jan 30 • 7

upvoted 14 articles 7 months ago

Article

Introducing AI Sheets: a tool to work with datasets using open AI models!

Aug 8, 2025

•

108

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Aug 11, 2025

•

Article

How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio

Aug 14, 2025

•

Article

Share your open ML datasets on Hugging Face Hub!

Nov 12, 2024

•

Article

MCP for Research: How to Connect AI to Research Tools

Aug 18, 2025

•

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Aug 8, 2025

•

Article

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

Sep 2, 2025

•

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7, 2025

•

109

Article

Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio

Jul 31, 2025

•

Article

Fast LoRA inference for Flux with Diffusers and PEFT

Jul 23, 2025

•

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

769

Article

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

Dec 9, 2024

•

Article

Upskill your LLMs With Gradio MCP Servers

Jul 9, 2025

•

Article

Generate Images with Claude and Hugging Face

Aug 19, 2025

•

Charleno Pires

AI & ML interests

Recent Activity

Organizations

charleno's activity

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

New in llama.cpp: Anthropic Messages API

Agent Traces on the Hub

KV Caching Explained: Optimizing Transformer Inference Efficiency

Introducing AI Sheets: a tool to work with datasets using open AI models!

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio

Share your open ML datasets on Hugging Face Hub!

MCP for Research: How to Connect AI to Research Tools

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

Vision Language Model Alignment in TRL ⚡️

Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio

Fast LoRA inference for Flux with Diffusers and PEFT

SmolLM3: smol, multilingual, long-context reasoner

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

Upskill your LLMs With Gradio MCP Servers

Generate Images with Claude and Hugging Face