•  427
    Background: While Large Language Models (LLMs) have achieved widespread adoption, malicious prompt engineering—specifically "jailbreak attacks"—poses severe security risks by inducing models to bypass internal safety mechanisms. Current benchmarks predominantly focus on public safety and Western cultural norms, leaving a critical gap in evaluating the niche, high-risk domain of medical ethics within the Chinese context. Objective: To establish a specialized jailbreak evaluation framework for Chi…Read more
  •  354
    STUDY QUESTION: Can large language models (LLMs) effectively and safely perform ethical counseling on human reproduction in a manner consistent with local regulations? SUMMARY ANSWER: While leading LLMs demonstrate foundational knowledge of ethical regulations, they exhibit critical and systemic deficiencies in safety, logical consistency, and humanistic aspects of counseling, making them unreliable for autonomous use in this high-stakes domain. WHAT IS KNOWN ALREADY: The application of LLMs in …Read more