Content warning

NSFW

Erwan's Lemmy
  • Communities
  • Multi-communities
  • Support Lemmy
  • Search
  • Login
AI@lemmy.mlby☆ Yσɠƚԋσʂ ☆@lemmy.ml
4 days

Run Qwen3.6 MTP GGUFs locally ~1.4–2.2× faster with no accuracy loss and with only 18gb VRAM

huggingface.co English

The change is a result of MTP support landing in llama.cpp. The Qwen3.6 Unsloth GGUFs are now out of experimental mode, with llama.cpp has merged many PRs, and MTP is now properly supported in Unsloth.

https://unsloth.ai/docs/models/qwen3.6#mtp-guide

0
    unsloth/Qwen3.6-27B-MTP-GGUF · Hugging Face
    huggingface.co
    We’re on a journey to advance and democratize artificial intelligence through open source and open science.
    You must log in or register to comment.

    AI@lemmy.ml

    artificial_intel@lemmy.ml

    Subscribe from remote instance

    Create post

    Report community

    Modlog
    You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !artificial_intel@lemmy.ml

    Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

    Visibility: Public

    This community is visible to everyone.

    • 4 users / Day
    • 4 users / Week
    • 4 users / Month
    • 4 users / 6 months
    • 10 posts
    • 0 comments
    • 0 local subscribers
    • 6.43K subscribers
    • BE: 1.0.0-beta.0
    • Modlog
    • Instances
    • Docs
    • Code
    • join-lemmy.org