Government posture
Japan AISI’s published evaluation and red-team guides indicate a concrete LLM-system safety-evaluation posture; this profile keeps AISI guidance separate from broader research outputs.
Japan has a clear public-source technical safety stack through NII LLMC, LLM-jp Safety WG, Japan AISI guidance, and Japanese benchmark work from SB Intuitions and RIKEN-linked teams.
Japan AISI’s published evaluation and red-team guides indicate a concrete LLM-system safety-evaluation posture; this profile keeps AISI guidance separate from broader research outputs.
Public documentation covers evaluation scope, red-teaming methods, prompt-injection classes, Japanese safety-boundary testing, guardrail benchmarking, and safety-enhancing Japanese datasets.
The ecosystem mixes institutional coordination with concrete outputs on guardrails, safety datasets, and interpretable trustworthy foundation technologies. RIKEN NLU remains medium-confidence for core frontier-safety relevance.
NII’s symposium and LLM-jp ecosystem appear designed to connect academic, industry, and policy actors in an internationally legible format.
AIセーフティ・インスティテュート
Strong public technical-evaluation node with explicit guides for LLM-system safety evaluation and red teaming.
独立行政法人 情報処理推進機構
Host agency in which Japan AISI is placed and where the AISI secretariat was set up.
AIセーフティ・インスティテュート関係府省庁等連絡会議
Cabinet Office-linked inter-ministerial coordination mechanism around Japan AISI, with ministries, research institutes, IPA, and AISI represented.
国立情報学研究所 大規模言語モデル研究開発センター
Key Japanese public node for technical LLM safety, with a Safety WG and safety symposiums.
LLM-jp 安全性ワーキンググループ
Public safety working group inside Japan’s open LLM ecosystem, visible through NII and LLM-jp resources.
Responsible AI チーム
Corporate technical-safety team with public Japanese benchmarks for safety-boundary testing and guardrails.
理研AIP 自然言語理解チーム
Relevant mainly through public interpretability and trustworthy-foundation framing plus adjacent Japanese LLM safety dataset work. This row remains medium-confidence and is not labeled as a core frontier AI safety lab.