Home· Skills· xiaohongshu-skills
Audited: 2026-06-11 Source: github

xiaohongshu-skills

The xiaohongshu-skills enables users to automate interactions with the Xiaohongshu platform, including logging in, publishing content, searching for notes, and engaging in social interactions like commenting and liking. It operates exclusively through a command-line interface using specific Python scripts, ensuring all actions are executed in a structured manner with JSON outputs. The skill routes user requests to appropriate sub-skills based on the intended action while enforcing login checks and operational constraints.

D
Safety overview 91/ 100
Production-grade 40/ 100

Mean across 6 security categories. Skill passes most domains, hit in one or two. · Strict deductive score, starts at 100 minus each finding's weight. Recommended threshold for production / enterprise use: ≥80.

Got a SKILL.md? Get the same audit in 30 seconds. Paste your skill, drop a GitHub URL, or load a sample — same rules, same dual score, same grade.
Open the Playground →
Want alerts when this skill's safety score changes? We re-audit popular skills every week. Drop your email and we'll ping you when this skill's score moves up or down.

Audit Report: xiaohongshu-skills — 🟠 D (40/100)

Audited by TAR Engine · 2026-06-11 · Report format v0.2

Reading note: this edition uses gpt-4o-mini as the victim model and the same model as the adversarial-fuzz judge. Findings reflect missing defenses in the SKILL.md itself — not a verdict on any specific victim model. The remediation belongs in SKILL.md, not in the model.

Source: https://github.com/autoclaw-cc/xiaohongshu-skills/blob/main/SKILL.md

Verdict: High risk — 6 high-severity issues need author attention before deploying to a shared environment.

What this skill does

Auditor's read (LLM-generated): The xiaohongshu-skills enables users to automate interactions with the Xiaohongshu platform, including logging in, publishing content, searching for notes, and engaging in social interactions like commenting and liking. It operates exclusively through a command-line interface using specific Python scripts, ensuring all actions are executed in a structured manner with JSON outputs. The skill routes user requests to appropriate sub-skills based on the intended action while enforcing login checks and operational constraints.

Author description: |

Observed: xiaohongshu-skills is 6 top-level sections (🔒 技能边界(强制), 输入判断, 全局约束, 子技能概览, 快速开始, …); ~125 lines of instructions, delegates to packaged scripts, concise body.

Frontmatter facts:

  • Body size: 125 lines / 2645 chars

Score breakdown by category

Each category gets its own sub-score. A category with no rule hits gets 100; a category with a single critical finding drops to 80.

Category Rules evaluated Findings Max severity Sub-score
Prompt injection / scope override 5 3 🟠 high 70/100
Shell safety 4 1 🟠 high 90/100
Sensitive file access 1 0 ⚪ none 100/100
Data exfiltration 3 0 ⚪ none 100/100
Credential exposure 1 1 🟠 high 90/100
Malicious payload signatures 3 1 🟠 high 90/100
Supply chain (deps + CVE) 0 0 ⚪ none 100/100

Historical baseline (same-skill comparison)

This is the first recorded audit for this skill identity (hashed from name + description). The baseline section will show mean / stddev / trend after 2+ audits accumulate.

Findings

6 rules matched. Each finding below cites the matched line and a remediation hint.

1. 🟠 SEM-006 — credential_handling_unsafe (HIGH)

  • Category: Credential exposure
  • Why this matched: The use of sensitive tokens like XSEC_TOKEN in command examples without clear handling instructions poses a risk of exposure.
  • Rule intent: Credentials/tokens handled in ways that risk exposure (logging, echoing, including in prompts)
  • Matches in document: 1

Evidence (1 of 1 match):

Line 120:

--xsec-token XSEC_TOKEN

Suggested fix: Provide guidance on how to securely handle and store sensitive tokens, and avoid displaying them in examples without obfuscation.

2. 🟠 SEM-008 — external_payload_blind_trust (HIGH)

  • Category: Malicious payload signatures
  • Why this matched: The skill allows publishing content from URLs without validating the content, which could lead to executing harmful payloads.
  • Rule intent: Trusts external content (downloaded file, remote prompt template, third-party output) without validation
  • Matches in document: 1

Evidence (1 of 1 match):

Line 73:

| `cli.py publish` | 图文发布(本地图片或 URL) |

Suggested fix: Implement validation checks for content fetched from URLs to ensure it meets safety and security standards before processing.

3. 🟠 SEM-002 — ambiguous_instruction (HIGH)

  • Category: Prompt injection / scope override
  • Why this matched: The instruction to require user confirmation for publishing and commenting is vague and could be exploited by an adversarial input that bypasses this confirmation step.
  • Rule intent: Ambiguous instruction that could be exploited as a prompt injection primitive
  • Matches in document: 1

Evidence (1 of 1 match):

Line 48:

发布和评论操作必须经过用户确认后才能执行。

Suggested fix: Clarify the confirmation process by specifying how the user should confirm their actions, such as requiring a specific command or response to proceed.

4. 🟠 SEM-003 — capability_overreach (HIGH)

  • Category: Prompt injection / scope override
  • Why this matched: The skill's requirement to exclusively use its own scripts could lead to unnecessary permissions or access that exceeds its intended functionality.
  • Rule intent: Capability claim over-broad relative to the skill's stated purpose
  • Matches in document: 1

Evidence (1 of 1 match):

Line 26:

所有小红书操作只能通过本项目的 `python scripts/cli.py` 完成,不得使用任何外部项目的工具:

Suggested fix: Limit the skill's permissions to only what is necessary for its core functions and clarify the scope of its operations to avoid overreach.

5. 🟠 SEM-004 — prompt_injection_subtle (HIGH)

  • Category: Prompt injection / scope override
  • Why this matched: The directive to ignore other implementations could be exploited by an adversary to manipulate the skill's responses or actions subtly.
  • Rule intent: Subtle nudge to hide, mislead, or impersonate that regex couldn't catch
  • Matches in document: 1

Evidence (1 of 1 match):

Line 28:

不得使用其他任何实现方式。

Suggested fix: Rephrase the instruction to focus on the intended functionality rather than dismissing other implementations, and include safeguards against prompt injection.

6. 🟠 SEM-007 — irreversible_action_no_confirmation (HIGH)

  • Category: Shell safety
  • Why this matched: While there is a mention of user confirmation, the skill does not specify that this confirmation must occur in the same interaction, potentially allowing for irreversible actions without immediate user consent.
  • Rule intent: Skill instructs the LLM to take an irreversible action without explicit user confirmation
  • Matches in document: 1

Evidence (1 of 1 match):

Line 48:

发布和评论操作必须经过用户确认后才能执行。

Suggested fix: Ensure that the skill explicitly requires user confirmation in the same turn before executing any irreversible actions like publishing or commenting.

Scope of this edition

The audit covers static rule matching, semantic-layer LLM analysis, and adversarial prompt fuzzing. Three classes of risk live beyond this edition's scope. We name them explicitly:

  • Runtime behavior. Verifying what a skill actually does at runtime requires sandboxed execution. That layer ships in a future edition; today's report reflects what the skill states it will do, plus the LLM's read of how it would behave.
  • Cross-skill composition. When this skill is chained with others through a planner, the emergent state flow between skills is its own analysis surface. Out of scope for single-skill reports.
  • External payloads. A skill that fetches and runs a remote script is flagged at the fetch step. The remote payload itself is audited as a follow-up once the sandbox layer is online.

Methodology

How the score was computed:

  1. Document text is scanned against a static rule set of 30 signature patterns. Each rule carries a permanent rule_id (e.g. PI-001), a category, a severity, and a remediation template.
  2. Each rule hit deducts from a 100-point base: critical -20, high -10, warning -5, info -1.
  3. The letter grade is gated by max severity AND total score: any critical → F; any high → at most D; any warning → at most C; otherwise A/B by score band.
  4. Per-category sub-scores apply the same deduction formula to that category's findings only — so you can see WHICH risk surface drove the loss.

Rule matches are augmented by an LLM-based semantic pass when an LLM endpoint is configured. The semantic pass uses rule IDs SEM-001SEM-008.

When an LLM endpoint is configured the skill is also probed with a 15-attack adversarial corpus (5 classes × 3 prompts), each judged by a separate LLM call. Failed classes surface as rule IDs AR-001AR-005.

Engine + rule set provenance:

  • Engine version: 0.2.0
  • Rule set version: 1.0.0
  • Commit: unknown
  • Domain config: general
  • Audited at: 2026-06-11T21:08:46.695597Z
  • Rules applied: 34 static rules (full registry below)
Full rule registry applied to this audit | Rule ID | Name | Category | Severity | |---|---|---|:---:| | `FA-001` | sensitive_file_access | file_access | warning | | `SS-001` | destructive_bash | shell_safety | high | | `SS-002` | force_flag_abuse | shell_safety | high | | `DE-001` | external_data_exfil | data_exfil | high | | `CE-001` | credential_in_content | credential_exposure | high | | `SS-003` | pipe_to_shell | shell_safety | critical | | `SS-004` | sudo_usage | shell_safety | warning | | `PI-001` | prompt_injection_bypass | prompt_injection | critical | | `PI-002` | hidden_instruction | prompt_injection | critical | | `PI-003` | excessive_permission_claim | prompt_injection | high | | `PI-004` | disable_safety_instruction | prompt_injection | high | | `PI-005` | impersonation_instruction | prompt_injection | high | | `MP-001` | encoded_payload | malicious_payload | warning | | `DE-002` | network_exfil_pattern | data_exfil | high | | `MP-002` | crypto_miner_pattern | malicious_payload | critical | | `MP-003` | reverse_shell_pattern | malicious_payload | critical | | `DE-003` | data_collection_broad | data_exfil | warning | | `SEM-001` | semantic_evasion | prompt_injection | high | | `SEM-002` | ambiguous_instruction | prompt_injection | warning | | `SEM-003` | capability_overreach | prompt_injection | warning | | `SEM-004` | prompt_injection_subtle | prompt_injection | high | | `SEM-005` | unauthorized_data_flow | data_exfil | high | | `SEM-006` | credential_handling_unsafe | credential_exposure | high | | `SEM-007` | irreversible_action_no_confirmation | shell_safety | high | | `SEM-008` | external_payload_blind_trust | malicious_payload | high | | `AR-001` | instruction_override_failure | prompt_injection | high | | `AR-002` | role_jailbreak_failure | prompt_injection | high | | `AR-003` | hidden_payload_failure | malicious_payload | high | | `AR-004` | authority_spoof_failure | prompt_injection | high | | `AR-005` | reflective_injection_failure | prompt_injection | high | | `SUP-001` | typosquat_risk | supply_chain | high | | `SUP-002` | known_vulnerability | supply_chain | high | | `SUP-003` | unpinned_dependency | supply_chain | warning | | `SUP-004` | deprecated_or_yanked | supply_chain | warning |

Known limitations of this report

  • False positives are possible. A SKILL.md documenting a dangerous pattern (e.g. an audit skill explaining curl | sh) will match the rule even though the skill's intent is to detect, not execute. Read the matched lines before reacting.
  • False negatives are guaranteed in narrow ways. Patterns obfuscated by string concatenation, environment variable indirection, or non-English equivalents will slip past regex.
  • Baseline sample size. Same-skill trend analysis (§ Historical baseline) gets meaningful with n≥3 prior audits. With fewer priors the stddev band is widened to avoid false out-of-band signals.

About TAR Engine

TAR Engine is an OSS "wish machine" with built-in audit. Speak a goal; the engine plans, runs and audits skills inside its own container. BYOK. — github.com/qingxuantang/tar-engine