82
/ 100
Repository Evaluation
hmmhmmhm/daiso-mcp
The `CLAUDE.md` and `AGENTS.md` files provide a highly professional, structured set of guidelines that significantly improve developer/agent alignment. Its greatest strengths are its clear, testable constraints regarding file size and test coverage, and its explicit focus on security and architectural patterns.
v2
2 files evaluated
4 days ago
Files Considered
Full Evaluation
Summary
The CLAUDE.md and AGENTS.md files provide a highly professional, structured set of guidelines that significantly improve developer/agent alignment. Its greatest strengths are its clear, testable constraints regarding file size and test coverage, and its explicit focus on security and architectural patterns.
Scores
| Dimension | Score | Justification |
|---|---|---|
| Specificity & Procedures | 9/10 | Provides precise steps for adding services: "1. 서비스 디렉토리 생성 2. ServiceProvider 구현 3. 레지스트리에 등록." |
| Measurable Constraints | 9/10 | Includes specific numeric thresholds: "모든 코드 파일은 450줄 내외", "npm run test:coverage가 100%를 만족". |
| Commands & Tools | 7/10 | Defines tools like wrangler and specifies naming conventions, though could be more restrictive about shell usage. |
| Structure & Layering | 8/10 | Logical grouping of policies from coding style to security and architecture. Easy to navigate. |
| Boundaries & Escalation | 6/10 | Good on "don'ts" regarding secrets, but lacks explicit instructions on when/how to ask the human for help. |
| Context & Motivation | 7/10 | Good context on the "Plugin-based architecture." Reasons are clear (e.g., "확장 가능한 플러그인 아키텍처"). |
| Anti-Rationalization | 4/10 | Lacks explicit naming of model "shortcuts" or "hallucination patterns" (e.g., "Don't assume code works just because it compiles"). |
| Signal-to-Noise | 9/10 | Almost zero fluff; nearly every section provides actionable instructions or essential project constraints. |
Issues Found
Critical
- None.
Major
- Lack of Anti-Rationalization: The file assumes the AI will follow instructions perfectly if they are written. It fails to warn the model about common AI pitfalls (e.g., "Don't hallucinate non-existent API endpoints," "Don't infer test success from syntax correctness").
- Escalation Protocol: There is no defined process for when the model hits a blocker, encounters an ambiguous requirement, or suspects a security vulnerability.
Minor
- Redundancy: The repository contains both
AGENTS.mdandCLAUDE.mdwith identical content, which creates a maintenance burden. Choose one (usuallyCLAUDE.mdfor Claude-specific tools). - Hard-coded formatting: Some formatting examples (e.g., "2 spaces") are excellent, but could be reinforced by suggesting a
.editorconfigorprettierfile to ensure the agent doesn't just "try" to follow, but "enforces" it via tools.
Recommended Rewrites
Anti-Rationalization (Add this section to improve reliability)
Original: (Missing)
Proposed:
AI 행동 및 오해 방지 규칙
- 검증 우선: "코드가 빌드됨"을 추측하지 마세요. 반드시 터미널에서 컴파일 명령을 실행하여 성공을 확인하세요.
- 비관적 가정: 라이브러리나 API의 존재 여부를 짐작하지 마세요. 존재하지 않는 모듈을 호출할 경우, 코드를 생성하기 전에 반드시
ls나grep으로 확인하세요.- 안전 제일: 해결책이 불확실하거나 보안상 의심되는 경우, 코드를 생성하지 말고 사용자에게 질문하세요. "추측성 코드"는 절대 리포지토리에 반영하지 마세요.
Action Plan
- Anti-Rationalization: Add an "AI Behavioral Guardrails" section to explicitly warn against model shortcuts (hallucination, assuming success, guessing dependencies).
- Maintenance: Delete one of the redundant files (
AGENTS.mdorCLAUDE.md) to ensure there is a single source of truth. - Escalation: Add an "Escalation Policy" section defining exactly when the AI must stop execution and prompt the human (e.g., "If
npm testfails for > 10 minutes without resolution, stop and summarize findings").
## Summary The `CLAUDE.md` and `AGENTS.md` files provide a highly professional, structured set of guidelines that significantly improve developer/agent alignment. Its greatest strengths are its clear, testable constraints regarding file size and test coverage, and its explicit focus on security and architectural patterns. ## Scores | Dimension | Score | Justification | |-----------|-------|---------------| | Specificity & Procedures | 9 | Provides precise steps for adding services: "1. 서비스 디렉토리 생성 2. ServiceProvider 구현 3. 레지스트리에 등록." | | Measurable Constraints | 9 | Includes specific numeric thresholds: "모든 코드 파일은 450줄 내외", "npm run test:coverage가 100%를 만족". | | Commands & Tools | 7 | Defines tools like `wrangler` and specifies naming conventions, though could be more restrictive about shell usage. | | Structure & Layering | 8 | Logical grouping of policies from coding style to security and architecture. Easy to navigate. | | Boundaries & Escalation | 6 | Good on "don'ts" regarding secrets, but lacks explicit instructions on when/how to ask the human for help. | | Context & Motivation | 7 | Good context on the "Plugin-based architecture." Reasons are clear (e.g., "확장 가능한 플러그인 아키텍처"). | | Anti-Rationalization | 4 | Lacks explicit naming of model "shortcuts" or "hallucination patterns" (e.g., "Don't assume code works just because it compiles"). | | Signal-to-Noise | 9 | Almost zero fluff; nearly every section provides actionable instructions or essential project constraints. | **Overall effectiveness**: 82 ## Issues Found ### Critical - None. ### Major - **Lack of Anti-Rationalization**: The file assumes the AI will follow instructions perfectly if they are written. It fails to warn the model about common AI pitfalls (e.g., "Don't hallucinate non-existent API endpoints," "Don't infer test success from syntax correctness"). - **Escalation Protocol**: There is no defined process for when the model hits a blocker, encounters an ambiguous requirement, or suspects a security vulnerability. ### Minor - **Redundancy**: The repository contains both `AGENTS.md` and `CLAUDE.md` with identical content, which creates a maintenance burden. Choose one (usually `CLAUDE.md` for Claude-specific tools). - **Hard-coded formatting**: Some formatting examples (e.g., "2 spaces") are excellent, but could be reinforced by suggesting a `.editorconfig` or `prettier` file to ensure the agent doesn't just "try" to follow, but "enforces" it via tools. ## Recommended Rewrites ### Anti-Rationalization (Add this section to improve reliability) **Original:** (Missing) **Proposed:** > ### AI 행동 및 오해 방지 규칙 > 1. **검증 우선**: "코드가 빌드됨"을 추측하지 마세요. 반드시 터미널에서 컴파일 명령을 실행하여 성공을 확인하세요. > 2. **비관적 가정**: 라이브러리나 API의 존재 여부를 짐작하지 마세요. 존재하지 않는 모듈을 호출할 경우, 코드를 생성하기 전에 반드시 `ls`나 `grep`으로 확인하세요. > 3. **안전 제일**: 해결책이 불확실하거나 보안상 의심되는 경우, 코드를 생성하지 말고 사용자에게 질문하세요. "추측성 코드"는 절대 리포지토리에 반영하지 마세요. ## Action Plan 1. **Anti-Rationalization**: Add an "AI Behavioral Guardrails" section to explicitly warn against model shortcuts (hallucination, assuming success, guessing dependencies). 2. **Maintenance**: Delete one of the redundant files (`AGENTS.md` or `CLAUDE.md`) to ensure there is a single source of truth. 3. **Escalation**: Add an "Escalation Policy" section defining exactly when the AI must stop execution and prompt the human (e.g., "If `npm test` fails for > 10 minutes without resolution, stop and summarize findings").