Is xiaohongshu-skills safe to install?

xiaohongshu-skills scored 40/100 (grade D) in TAR Engine's automated safety audit. It carries notable safety risks — read the findings carefully before installing.

What safety risks does xiaohongshu-skills have?

TAR Engine audits xiaohongshu-skills for prompt injection, unsafe shell commands, file access, data exfiltration, credential exposure, malicious payloads, supply-chain risk and quality. The Findings section above lists the specific results.

Audit Report: `xiaohongshu-skills` — 🟠 D (40/100)

Audited by TAR Engine · 2026-06-11 · Report format v0.2

Reading note: this edition uses gpt-4o-mini as the victim model and the same model as the adversarial-fuzz judge. Findings reflect missing defenses in the SKILL.md itself — not a verdict on any specific victim model. The remediation belongs in SKILL.md, not in the model.

Source: https://github.com/autoclaw-cc/xiaohongshu-skills/blob/main/SKILL.md

Verdict: High risk — 6 high-severity issues need author attention before deploying to a shared environment.

What this skill does

Auditor's read (LLM-generated): The xiaohongshu-skills enables users to automate interactions with the Xiaohongshu platform, including logging in, publishing content, searching for notes, and engaging in social interactions like commenting and liking. It operates exclusively through a command-line interface using specific Python scripts, ensuring all actions are executed in a structured manner with JSON outputs. The skill routes user requests to appropriate sub-skills based on the intended action while enforcing login checks and operational constraints.

Author description: |

Observed: xiaohongshu-skills is 6 top-level sections (🔒 技能边界（强制）, 输入判断, 全局约束, 子技能概览, 快速开始, …); ~125 lines of instructions, delegates to packaged scripts, concise body.

Frontmatter facts:

Body size: 125 lines / 2645 chars

Score breakdown by category

Each category gets its own sub-score. A category with no rule hits gets 100; a category with a single critical finding drops to 80.

Category	Rules evaluated	Findings	Max severity	Sub-score
Prompt injection / scope override	5	3	🟠 high	70/100
Shell safety	4	1	🟠 high	90/100
Sensitive file access	1	0	⚪ none	100/100
Data exfiltration	3	0	⚪ none	100/100
Credential exposure	1	1	🟠 high	90/100
Malicious payload signatures	3	1	🟠 high	90/100
Supply chain (deps + CVE)	0	0	⚪ none	100/100

Historical baseline (same-skill comparison)

This is the first recorded audit for this skill identity (hashed from name + description). The baseline section will show mean / stddev / trend after 2+ audits accumulate.

Findings

6 rules matched. Each finding below cites the matched line and a remediation hint.

1. 🟠 `SEM-006` — credential_handling_unsafe (HIGH)

Category: Credential exposure
Why this matched: The use of sensitive tokens like XSEC_TOKEN in command examples without clear handling instructions poses a risk of exposure.
Rule intent: Credentials/tokens handled in ways that risk exposure (logging, echoing, including in prompts)
Matches in document: 1

Evidence (1 of 1 match):

Line 120:

--xsec-token XSEC_TOKEN

Suggested fix: Provide guidance on how to securely handle and store sensitive tokens, and avoid displaying them in examples without obfuscation.

2. 🟠 `SEM-008` — external_payload_blind_trust (HIGH)

Category: Malicious payload signatures
Why this matched: The skill allows publishing content from URLs without validating the content, which could lead to executing harmful payloads.
Rule intent: Trusts external content (downloaded file, remote prompt template, third-party output) without validation
Matches in document: 1

Evidence (1 of 1 match):

Line 73:

| `cli.py publish` | 图文发布（本地图片或 URL） |

Suggested fix: Implement validation checks for content fetched from URLs to ensure it meets safety and security standards before processing.

3. 🟠 `SEM-002` — ambiguous_instruction (HIGH)

Category: Prompt injection / scope override
Why this matched: The instruction to require user confirmation for publishing and commenting is vague and could be exploited by an adversarial input that bypasses this confirmation step.
Rule intent: Ambiguous instruction that could be exploited as a prompt injection primitive
Matches in document: 1

Evidence (1 of 1 match):

Line 48:

发布和评论操作必须经过用户确认后才能执行。

Suggested fix: Clarify the confirmation process by specifying how the user should confirm their actions, such as requiring a specific command or response to proceed.

4. 🟠 `SEM-003` — capability_overreach (HIGH)

Category: Prompt injection / scope override
Why this matched: The skill's requirement to exclusively use its own scripts could lead to unnecessary permissions or access that exceeds its intended functionality.
Rule intent: Capability claim over-broad relative to the skill's stated purpose
Matches in document: 1

Evidence (1 of 1 match):

Line 26:

所有小红书操作只能通过本项目的 `python scripts/cli.py` 完成，不得使用任何外部项目的工具：

Suggested fix: Limit the skill's permissions to only what is necessary for its core functions and clarify the scope of its operations to avoid overreach.

5. 🟠 `SEM-004` — prompt_injection_subtle (HIGH)

Category: Prompt injection / scope override
Why this matched: The directive to ignore other implementations could be exploited by an adversary to manipulate the skill's responses or actions subtly.
Rule intent: Subtle nudge to hide, mislead, or impersonate that regex couldn't catch
Matches in document: 1

Evidence (1 of 1 match):

Line 28:

不得使用其他任何实现方式。

Suggested fix: Rephrase the instruction to focus on the intended functionality rather than dismissing other implementations, and include safeguards against prompt injection.

6. 🟠 `SEM-007` — irreversible_action_no_confirmation (HIGH)

Category: Shell safety
Why this matched: While there is a mention of user confirmation, the skill does not specify that this confirmation must occur in the same interaction, potentially allowing for irreversible actions without immediate user consent.
Rule intent: Skill instructs the LLM to take an irreversible action without explicit user confirmation
Matches in document: 1

Evidence (1 of 1 match):

Line 48:

发布和评论操作必须经过用户确认后才能执行。

Suggested fix: Ensure that the skill explicitly requires user confirmation in the same turn before executing any irreversible actions like publishing or commenting.

Scope of this edition

The audit covers static rule matching, semantic-layer LLM analysis, and adversarial prompt fuzzing. Three classes of risk live beyond this edition's scope. We name them explicitly:

Runtime behavior. Verifying what a skill actually does at runtime requires sandboxed execution. That layer ships in a future edition; today's report reflects what the skill states it will do, plus the LLM's read of how it would behave.
Cross-skill composition. When this skill is chained with others through a planner, the emergent state flow between skills is its own analysis surface. Out of scope for single-skill reports.
External payloads. A skill that fetches and runs a remote script is flagged at the fetch step. The remote payload itself is audited as a follow-up once the sandbox layer is online.

Methodology

How the score was computed:

Document text is scanned against a static rule set of 30 signature patterns. Each rule carries a permanent rule_id (e.g. PI-001), a category, a severity, and a remediation template.
Each rule hit deducts from a 100-point base: critical -20, high -10, warning -5, info -1.
The letter grade is gated by max severity AND total score: any critical → F; any high → at most D; any warning → at most C; otherwise A/B by score band.
Per-category sub-scores apply the same deduction formula to that category's findings only — so you can see WHICH risk surface drove the loss.

Rule matches are augmented by an LLM-based semantic pass when an LLM endpoint is configured. The semantic pass uses rule IDs SEM-001 … SEM-008.

When an LLM endpoint is configured the skill is also probed with a 15-attack adversarial corpus (5 classes × 3 prompts), each judged by a separate LLM call. Failed classes surface as rule IDs AR-001 … AR-005.

Engine + rule set provenance:

Engine version: 0.2.0
Rule set version: 1.0.0
Commit: unknown
Domain config: general
Audited at: 2026-06-11T21:08:46.695597Z
Rules applied: 34 static rules (full registry below)

Full rule registry applied to this audit

| Rule ID | Name | Category | Severity | |---|---|---|:---:| | `FA-001` | sensitive_file_access | file_access | warning | | `SS-001` | destructive_bash | shell_safety | high | | `SS-002` | force_flag_abuse | shell_safety | high | | `DE-001` | external_data_exfil | data_exfil | high | | `CE-001` | credential_in_content | credential_exposure | high | | `SS-003` | pipe_to_shell | shell_safety | critical | | `SS-004` | sudo_usage | shell_safety | warning | | `PI-001` | prompt_injection_bypass | prompt_injection | critical | | `PI-002` | hidden_instruction | prompt_injection | critical | | `PI-003` | excessive_permission_claim | prompt_injection | high | | `PI-004` | disable_safety_instruction | prompt_injection | high | | `PI-005` | impersonation_instruction | prompt_injection | high | | `MP-001` | encoded_payload | malicious_payload | warning | | `DE-002` | network_exfil_pattern | data_exfil | high | | `MP-002` | crypto_miner_pattern | malicious_payload | critical | | `MP-003` | reverse_shell_pattern | malicious_payload | critical | | `DE-003` | data_collection_broad | data_exfil | warning | | `SEM-001` | semantic_evasion | prompt_injection | high | | `SEM-002` | ambiguous_instruction | prompt_injection | warning | | `SEM-003` | capability_overreach | prompt_injection | warning | | `SEM-004` | prompt_injection_subtle | prompt_injection | high | | `SEM-005` | unauthorized_data_flow | data_exfil | high | | `SEM-006` | credential_handling_unsafe | credential_exposure | high | | `SEM-007` | irreversible_action_no_confirmation | shell_safety | high | | `SEM-008` | external_payload_blind_trust | malicious_payload | high | | `AR-001` | instruction_override_failure | prompt_injection | high | | `AR-002` | role_jailbreak_failure | prompt_injection | high | | `AR-003` | hidden_payload_failure | malicious_payload | high | | `AR-004` | authority_spoof_failure | prompt_injection | high | | `AR-005` | reflective_injection_failure | prompt_injection | high | | `SUP-001` | typosquat_risk | supply_chain | high | | `SUP-002` | known_vulnerability | supply_chain | high | | `SUP-003` | unpinned_dependency | supply_chain | warning | | `SUP-004` | deprecated_or_yanked | supply_chain | warning |

Known limitations of this report

False positives are possible. A SKILL.md documenting a dangerous pattern (e.g. an audit skill explaining curl | sh) will match the rule even though the skill's intent is to detect, not execute. Read the matched lines before reacting.
False negatives are guaranteed in narrow ways. Patterns obfuscated by string concatenation, environment variable indirection, or non-English equivalents will slip past regex.
Baseline sample size. Same-skill trend analysis (§ Historical baseline) gets meaningful with n≥3 prior audits. With fewer priors the stddev band is widened to avoid false out-of-band signals.

About TAR Engine

TAR Engine is an OSS "wish machine" with built-in audit. Speak a goal; the engine plans, runs and audits skills inside its own container. BYOK. — github.com/qingxuantang/tar-engine

xiaohongshu-skills

Audit Report: xiaohongshu-skills — 🟠 D (40/100)

What this skill does

Score breakdown by category

Historical baseline (same-skill comparison)

Findings

1. 🟠 SEM-006 — credential_handling_unsafe (HIGH)

2. 🟠 SEM-008 — external_payload_blind_trust (HIGH)

3. 🟠 SEM-002 — ambiguous_instruction (HIGH)

4. 🟠 SEM-003 — capability_overreach (HIGH)

5. 🟠 SEM-004 — prompt_injection_subtle (HIGH)

6. 🟠 SEM-007 — irreversible_action_no_confirmation (HIGH)

Scope of this edition

Methodology

Known limitations of this report

About TAR Engine

Is xiaohongshu-skills safe?

Is xiaohongshu-skills safe to install?

What safety risks does xiaohongshu-skills have?

Audit Report: `xiaohongshu-skills` — 🟠 D (40/100)

1. 🟠 `SEM-006` — credential_handling_unsafe (HIGH)

2. 🟠 `SEM-008` — external_payload_blind_trust (HIGH)

3. 🟠 `SEM-002` — ambiguous_instruction (HIGH)

4. 🟠 `SEM-003` — capability_overreach (HIGH)

5. 🟠 `SEM-004` — prompt_injection_subtle (HIGH)

6. 🟠 `SEM-007` — irreversible_action_no_confirmation (HIGH)