로딩 중...
A real-time incident response record in which an ML engineer, with the help of Claude Code, discovered and disclosed a supply chain attack hidden in litellm version 1.82.8 on PyPI within 72 minutes. It demonstrates that even non-security developers can detect and report malware using AI tools.
A post sharing how to run Claude Code fully offline on a MacBook by connecting it to a local LLM without an API key or cloud, useful for developers who want to use an AI coding assistant at no cost.
A TypeScript library that combines Playwright browser automation with LLMs to reliably extract structured data from web pages, with a focus on token efficiency and JSON parsing stability.
A post about giving Claude AI access to a macOS environment, sharing real-world use cases for integrating a local computer with AI.
An analysis post arguing that the perceived sudden reduction in Claude Code limits is not an actual limit decrease, but rather a spike in token consumption driven by the 1M context window.
Claude Code 생산성을 높이는 GitHub 저장소 6개를 실제 사용해본 경험을 바탕으로 소개한 포스트로, 메모리 관리·UI 생성·워크플로우 자동화 등 실전 도구들을 한눈에 정리했다.
Google Research 2-stage vector compression — PolarQuant + QJL achieves 6x KV cache reduction with zero accuracy loss and 8x attention speedup on H100 GPUs
A project that designs a hierarchical memory structure (Cognitive Architecture) based on plain-text files to address Claude Code's inability to retain memory across sessions. A practical reference for developers who want to use AI coding assistants consistently over the long term.
A Kubernetes-based workflow automation tool where an AI agent writes code from GitHub Issues or Linear tickets, automatically fixes CI failures, incorporates review comments, and merges PRs — all without human intervention. It stands out for fully automating the entire ticket-to-PR cycle.
The Claude Code agent autonomously combined and improved existing jailbreak attack algorithms, achieving 40% ASR against GPT-OSS-Safeguard-20B and 100% ASR against Meta-SecAlign-70B.
A triple-layer security framework where an independent Watcher agent intercepts threats in real time before AI agents executing shell commands get compromised
A learning project that reimplements the core architecture of Claude Code in Swift across 9 stages to understand why it works so well, directly validating the design philosophy of 'fewer tools, trust the model more.'
A post sharing the experience that sending a short greeting like 'hey' to Claude first can consume a significant portion of your total usage limit, raising awareness about prompt-writing habits for token conservation.
Evaluates code review agents using executable tests instead of text similarity — Claude Code 32.1%, all 4 tools combined 41.5%, vs human 100%
Running GPT-4, Claude-3, and Groq simultaneously to automatically extract software requirements achieves F1 0.88 and reduces analysis time by 78%.
Large-scale study with 8 LLMs and 22,374 program variants — over 99% of LLM-generated tests remain aligned to original code patterns, degrading sharply after code changes
A Rust-based open-source project that intelligently distributes LLM models across GPU, RAM, and NVMe when they exceed your Mac's physical memory, enabling models that crash llama.cpp with OOM errors to actually run.
Google's Gemini Embedding model can now embed video directly into vectors without text transcription, enabling natural language search over dashcam footage — describe 'red truck running a stop sign' and get the clip back.
Discovers that LLM refusal behavior is dominated by a sparse set of tokens — achieves 90% attack success rate with 70% fewer queries; GPT-4o 84% ASR at 25 queries
A post about a ChatGPT prompt that discovers suitable career paths you didn't know you qualified for based on your experience and skills — a practical example of using AI for career exploration.