Ali Bidaran PRO
alibidaran
AI & ML interests
AI resercher, LLMs, Computer Vision, Generative AI, NLP, Machine /Deep learning, Reinforcement Learning
Recent Activity
updated a Space about 3 hours ago
alibidaran/MEDPAI published a Space about 3 hours ago
alibidaran/MEDPAI repliedto their post 1 day ago
๐ง Introducing Qwen3.5 โ Cognitive Reasoning Mode
I fine-tuned Qwen2.5 with GRPO to actually think before it answers โ not just pattern-match.
Most LLMs mimic reasoning. This one builds a real cognitive path:
๐ Plan โ understand the task
๐ Monitor โ reason step by step
โ
Evaluate โ verify before answering
Every response follows a strict structured protocol:
<think> <planning> ... <monitoring> ... <evaluation> ... </think>
Then a clean, reasoning-free <output>.
The model self-checks its own structure. If a section is missing or malformed โ the response is invalid.
This isn't chain-of-thought slapped on top. The reasoning protocol is baked in via RL.
๐ Full README + inference code below ๐
https://huggingface.co/alibidaran/Qwen_COG_Thinker_Merged
#AI #LLM #Qwen #ReasoningModels #GRPO #OpenSourceOrganizations
None yet