noneabove1182@sh.itjust.worksMEnglish · 1 year agoBeginner questions threadplus-squarepinmessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareBeginner questions threadplus-squarepinnoneabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink
SmokeyDope@lemmy.worldEnglish · 6 hours agoReturning back to where it started with llama 3 8B. DeepHermes is a great for 8gb VRAM cardsplus-squaremessage-squaremessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1message-squareReturning back to where it started with llama 3 8B. DeepHermes is a great for 8gb VRAM cardsplus-squareSmokeyDope@lemmy.worldEnglish · 6 hours agomessage-square0fedilink
morrowind@lemm.eeEnglish · 10 hours agoSketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketchingplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up13arrow-down10
arrow-up13arrow-down1external-linkSketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketchingplus-squarearxiv.orgmorrowind@lemm.eeEnglish · 10 hours agomessage-square0fedilink
hendrik@palaver.p3x.deEnglish · edit-218 hours agoRecommendations for a lightweight Python LLM framework for a webapp?plus-squaremessage-squaremessage-square4fedilinkarrow-up15arrow-down11
arrow-up14arrow-down1message-squareRecommendations for a lightweight Python LLM framework for a webapp?plus-squarehendrik@palaver.p3x.deEnglish · edit-218 hours agomessage-square4fedilink
SmokeyDope@lemmy.worldEnglish · edit-22 days agoCan Your LLM pass the Sun Theft Vibe Check (STVC) benchmark?plus-squaremessage-squaremessage-square0fedilinkarrow-up12arrow-down10
arrow-up12arrow-down1message-squareCan Your LLM pass the Sun Theft Vibe Check (STVC) benchmark?plus-squareSmokeyDope@lemmy.worldEnglish · edit-22 days agomessage-square0fedilink
SmokeyDope@lemmy.worldEnglish · edit-22 days agoDeepHermes Preview features swappable standard output to R1 distill CoT reasoning. Its kind of blowing my mind.plus-squarelemmy.worldimagemessage-square0fedilinkarrow-up12arrow-down10
arrow-up12arrow-down1imageDeepHermes Preview features swappable standard output to R1 distill CoT reasoning. Its kind of blowing my mind.plus-squarelemmy.worldSmokeyDope@lemmy.worldEnglish · edit-22 days agomessage-square0fedilink
thickertoofan@lemm.eeEnglish · 2 days agoLoaded benchmark for 1-3-4-7b models?plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareLoaded benchmark for 1-3-4-7b models?plus-squarethickertoofan@lemm.eeEnglish · 2 days agomessage-square0fedilink
thickertoofan@lemm.eeEnglish · 4 days agoGemma 3 1B and 3B result on a "needle in a haystack" like test ran locallyplus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareGemma 3 1B and 3B result on a "needle in a haystack" like test ran locallyplus-squarethickertoofan@lemm.eeEnglish · 4 days agomessage-square0fedilink
Lantier@jlai.luEnglish · edit-24 days agoNew release: Gemma 3 family of modelsplus-squarehuggingface.coexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkNew release: Gemma 3 family of modelsplus-squarehuggingface.coLantier@jlai.luEnglish · edit-24 days agomessage-square0fedilink
Björn Tantau@swg-empire.deEnglish · edit-25 days agoIs there a German 7B Vision Model?plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareIs there a German 7B Vision Model?plus-squareBjörn Tantau@swg-empire.deEnglish · edit-25 days agomessage-square0fedilink
morrowind@lemm.eeEnglish · 5 days agoSorting-Free GPU Kernels for LLM Samplingplus-squareflashinfer.aiexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkSorting-Free GPU Kernels for LLM Samplingplus-squareflashinfer.aimorrowind@lemm.eeEnglish · 5 days agomessage-square0fedilink
morrowind@lemm.eeEnglish · 5 days agoReka Flash, open source 21B model comparable to QWQ 32Bi.postimg.ccimagemessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageReka Flash, open source 21B model comparable to QWQ 32Bi.postimg.ccmorrowind@lemm.eeEnglish · 5 days agomessage-square0fedilink
Lantier@jlai.luEnglish · 12 days agoQwen/QwQ-32B · Hugging Faceplus-squarehuggingface.coexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkQwen/QwQ-32B · Hugging Faceplus-squarehuggingface.coLantier@jlai.luEnglish · 12 days agomessage-square0fedilink
Oskar@piefed.socialEnglish · 12 days agoMac Studio 2025plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareMac Studio 2025plus-squareOskar@piefed.socialEnglish · 12 days agomessage-square0fedilink
ikt@aussie.zoneEnglish · 12 days agoNVIDIA's GeForce RTX 4090 With 96GB VRAM Reportedly Exists; The GPU May Enter Mass Production Soon, Targeting AI Workloadsplus-squarewccftech.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkNVIDIA's GeForce RTX 4090 With 96GB VRAM Reportedly Exists; The GPU May Enter Mass Production Soon, Targeting AI Workloadsplus-squarewccftech.comikt@aussie.zoneEnglish · 12 days agomessage-square0fedilink
morrowind@lemm.eeEnglish · 14 days agoChain of Draft: Thinking Faster by Writing Lessplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkChain of Draft: Thinking Faster by Writing Lessplus-squarearxiv.orgmorrowind@lemm.eeEnglish · 14 days agomessage-square0fedilink
morrowind@lemm.eeEnglish · 14 days agoAtom of Thoughts (AOT): lifts gpt-4o-mini to 80.6% F1 on HotpotQA, surpassing o3-mini and DeepSeek-R1plus-squarebsky.appexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkAtom of Thoughts (AOT): lifts gpt-4o-mini to 80.6% F1 on HotpotQA, surpassing o3-mini and DeepSeek-R1plus-squarebsky.appmorrowind@lemm.eeEnglish · 14 days agomessage-square0fedilink
ikt@aussie.zoneEnglish · 15 days agoCrossing the uncanny valley of conversational voiceplus-squarewww.sesame.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkCrossing the uncanny valley of conversational voiceplus-squarewww.sesame.comikt@aussie.zoneEnglish · 15 days agomessage-square0fedilink
Smorty [she/her]@lemmy.blahaj.zoneEnglish · 17 days agoApparently microsofts new phi-4-mini is business-tuned?plus-squarelemmy.blahaj.zoneimagemessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1imageApparently microsofts new phi-4-mini is business-tuned?plus-squarelemmy.blahaj.zoneSmorty [she/her]@lemmy.blahaj.zoneEnglish · 17 days agomessage-square0fedilink
shaked_coffee@feddit.itEnglish · 19 days agoFramework just announced their new Desktop: an AI powerhorse?plus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareFramework just announced their new Desktop: an AI powerhorse?plus-squareshaked_coffee@feddit.itEnglish · 19 days agomessage-square0fedilink