hermes-agent/skills/mlops/inference/llama-cpp
Teknium 7ff7155cbd fix(skills/llama-cpp): concise description, restore python bindings, fix curl
- Description truncated to 60 chars in system prompt (extract_skill_description),
  so the 500-char HF workflow description never reached the agent; shortened to
  'llama.cpp local GGUF inference + HF Hub model discovery.' (56 chars).
- Restore llama-cpp-python section (basic, chat+stream, embeddings,
  Llama.from_pretrained) and frontmatter dependencies entry.
- Fix broken 'Authorization: Bearer ***' curl line (missing closing quote;
  llama-server doesn't require auth by default).
2026-04-21 13:30:10 -07:00
..
references improve llama.cpp skill 2026-04-21 13:30:10 -07:00
SKILL.md fix(skills/llama-cpp): concise description, restore python bindings, fix curl 2026-04-21 13:30:10 -07:00