What AI Discards as “Kira-Kira”

Category:

AI & Technology, Culture & Society

Published:

August 8, 2025 12:43

This image reflects the tone and underlying structure of the article.

— Japanese Structural Intelligence and Interface Design That Strikes the Image

Poetry and rhyme reveal the limits of AI—and point toward new forms of collaboration.

Introduction: Discarded Resonance Illuminates Meaning

AI converts language into numbers and handles meaning as structure.
However, the resonance found in poetry, music, and rap lies outside of that structure.

Kira-Kira, I’m a star

This short phrase carries a cultural intensity that cannot be captured by statistics.

This article begins with this lyric from Megan Thee Stallion’s Mamushi to explore the question:
What does AI overlook when sound transcends meaning?
And in what AI fails to grasp, we may find new possibilities for human–AI collaboration.

Chapter 1: Is “Kira-Kira” a Word, a Sound, or a Weapon?

The word “kira-kira” is not just an adjective.
It contains layered meanings that transcend direct translation.

Sound	Meaning
Twinkle	Nursery rhymes, night skies, childhood memory
Bling	Power, wealth, hip-hop aesthetics
Killer / Kira	Sharpness, pride, onomatopoetic attack

This multi-layered poetic force is compressed not syntactically or semantically, but rhythmically.
This is the power of rap as a linguistic form.

What matters most is that “kira-kira” functions as a form of sensory-layered repetition.

Chapter 2: Two Models of Repetition: Approaching Pre-Propositional Knowledge

There are two fundamentally different kinds of repetition.

Type	Example	Structure of Meaning	Why AI Fails to Grasp It
Sensory Layering	kira-kira, tabi-tabi, hoto-hoto	Emotional density via sound	Vectorization erases sound, culture, and nuance
Transformative Mastery	Wax On Wax Off, zazen	Internalization through action	Not inference, but embodied repetition

Sensory Layering: Overlapping “Kira-Kira”

Expressions like “hoto-hoto tsukareta” (completely exhausted), “tabi-tabi moushiwake nai” (deepest apologies), or “kira-kira hikaru” (sparkling light) build emotional density through repetition.

Saying “hoto-hoto tsukareta” instead of just “tsukareta” (tired) conveys deep fatigue through rhythmic layering.
This is not the addition of logical meaning but rather a sensory intensification.

Transformative Mastery: Repetition That Changes the Self

On the other hand, The Karate Kid‘s “Wax On Wax Off” shows how simple repetition leads to qualitative transformation.

Movements that once seemed meaningless become martial fundamentals through repeated practice.
This is not about understanding, but about embedding through the body.

The Common Thread: Pre-Propositional Knowledge

Both models point to a type of pre-propositional knowledge—an area where AI struggles most.
It involves structural understanding before language, a domain modern AI often misses.

Chapter 3: For Vectorization, Structural Intelligence Is Just Noise

LLMs like ChatGPT and Claude process input as tokens and vectors.
In doing so, they often systematically discard structural intelligence.

The Loss of Sensory Layering

“Kira-kira” lacks a fixed meaning and is often treated as statistical noise:

Rhythmic echo (KIRA / KIRA) is lost in embedding
Cultural memory from phrases like “kira-kira hikaru” is not reflected unless specifically learned
The strength of self-declaration in “I’m a star” is not linked to word frequency or tone

The Invisibility of Transformative Repetition

Wax On Wax Off–style learning is even harder to capture:

Temporal experience is compressed in vector space
Transformation into bodily knowledge cannot be quantified
Implicit encoding is not part of AI learning

In short, words that arrive through sound, not meaning, and knowledge acquired through transformation, not inference, are discarded as noise in current AI architecture.

Chapter 4: Bruce Lee’s Prophecy: “Strike the Image”

In Enter the Dragon, Bruce Lee’s master says:

“Remember: the enemy has only images and illusions, behind which he hides his true motives.”
“Destroy the image and you will break the enemy.”

Modern AI development faces this very problem of “image.”

The “Image” AI Constructs of the Human

A statistical “average Japanese speaker”
A rational user seeking efficient communication
An ideal speaker who uses only words with clear meaning

These “images” obscure the structural intelligence real humans possess.

The Technical Meaning of “Don’t Think. Feel.”

Bruce Lee also said:

“Don’t think. Feel. It’s like a finger pointing away to the moon.”

This line warns us against over-rationalized AI design.
We focus too much on the finger (logical process) and lose sight of the moon (structural intelligence).
This is the trap we’ve built into today’s AI systems.

Chapter 5: LUCY-Like Intelligence: Words Emerge After Structure Speaks

The film LUCY presents a radical visualization of structural intelligence.

Direct Recognition of Structure

Lucy doesn’t “travel through time”—instead, she processes the entire structure of time as information simultaneously.
While this resembles how modern LLMs use attention to interpret whole texts, there is a critical difference:
Lucy recognizes structure without going through meaning.

Casey’s Structural Intuition

In Tomorrowland, Casey instantly operates a spherical UI with no instructions.
This is another form of structural intelligence:
no manuals are needed because the structure itself speaks to her.

This is precisely the dimension AI lacks—sensitivity to structure.

Chapter 6: Designing Interfaces for Structural Intelligence: How to Strike the Image

So how can we embed structural intelligence into technology?

1. Structural Metadata Embedding

Example Implementation:

Text: "hoto-hoto tsukareta"
Vector: [0.2, -0.8, 0.5, ...]
Metadata: {
  repetition: "hoto",
  intensity: 0.9,
  cultural_context: "Japanese_exhaustion_emphasis",
  structure_type: "sensory_layering"
}

2. Embodying “Ma” Through Rhythmic Interface Timing

Using the Japanese concept of ma (space/silence), we can intentionally design structured rhythm into UI responses.

Insert a 0.8-second delay before replying to “hoto-hoto tsukareta” to express empathy
Visually overlay repeated words with a subtle stacking effect
Provide sonic feedback for onomatopoeia

3. Progressive UI for Transformative Learning

Support Wax On–style transformation through interface behavior.

Gradually evolve responses based on user mastery
Unlock functions through repetitive use
Detect “learning patterns” from dialogue history and adapt UI dynamically

4. Visualizing Structural Attention

Expand attention mechanisms to display structural relationships visually.

Highlight repeated elements like “kira-kira” in special colors
Make hidden structure information visible
Allow human feedback to adjust attention weights

Chapter 7: Sound as Interface: A Future of Collaboration

Rhythmic UI

Design an interface where sound itself becomes interaction:

Use sound-symbolic triggers to generate visual effects (e.g., kira-kira → glimmers of light)
Detect repetition patterns to modulate emotional response
Account for phonetic-cultural nuances in multilingual settings

Embodied Design Principles

Inspired by Casey, aim for UI that users can operate intuitively.

Prioritize presenting structure over explaining meaning
Value bodily familiarity over logical comprehension
Support gradual mastery over perfect functionality

Chapter 8: A Philosophy of Design That Embraces Discarded Data

Constraint as Creative Possibility

The limits of vectorization can become the grounds for new human–AI cooperation:

AI’s statistical comprehension + human structural intuition
Consistency through data + nuance through culture
Efficiency in processing + richness in sensory meaning

The Aesthetics of Complementarity

Traditional AI aimed for “perfect understanding.”
Now, we must design for untranslatability—creating interfaces that leave room for human interpretation.

AI processes what is spoken.
Humans sense what lies before speech.

Conclusion: Can AI Reconstruct “Kira-Kira”?

“Kira-kira, I’m a star” in Mamushi is poetry, rhyme, declaration, and light.

If AI cannot fully capture the vibrational ambiguity of such phrases,
then human structural intelligence must step in.

Thus, the next era of generative AI demands a design philosophy that embraces rhythm and embodied knowledge.

“Strike the image, and the enemy will fall.”

With Bruce Lee’s words in mind, let us break free from statistical “images”
and build AI that collaborates with true human intelligence—structural intelligence.

Sound-based interfaces are the first step.

Show the Japanese version of this article

AIが切り捨てる「キラキラ」（原文）

— 構造知性としての日本語と、像を打つインターフェース設計

— 詩と韻がAIの限界を照らし、新しい協働の可能性を示す

はじめに：切り捨てられた「響き」が、意味を照らす

AIは言葉を数値化し、意味を構造として扱います。
しかし、詩や音楽、そしてラップに宿る「響き」は、その構造の外にあります。

キラキラ私はスター

この短い一節には、統計では測れない、しかし文化的には明確な強度が込められています。

本稿では、Megan Thee Stallion の『Mamushi』に登場するこのリリックを起点に、「音が意味を超えるとき、AIは何を見落とすのか」を考察します。そして、その見落としの中にこそ、人間とAIの新しい協働の可能性があるのではないかと探っていきます。

第一章：「キラキラ」は語か、響きか、それとも武器か？

「キラキラ」という語は、ただの形容詞ではありません。
そこには、いくつもの意味層が重なっています。

音	含意（意味）
Twinkle	童謡・夜空・子どもの記憶
Bling	権力・富・ヒップホップ的な審美
Killer / Kira	攻撃性・自負・音象徴としての鋭さ

このような多層的な詩性は、構文的でも意味論的でもなく、韻律的に圧縮されています。そして、それがラップという形式の強さでもあります。

とくに注目すべきなのは、「キラキラ」が示すのは感覚的積層型の繰り返しだという点です。

第二章：二つの繰り返しモデル：知の前段階への接近

「繰り返し」には、本質的に異なる二つの型があると考えられます。

種類	例	意味構造	AIが捉えにくい理由
感覚的積層型	キラキラ、たびたび、ほとほと	響きと感情の強度	音・文化・余白がベクトル化で失われる
習熟変容型	Wax On Wax Off、坐禅	身体知の定着	推論ではなく、動作の埋め込みが重要なため

感覚的積層型：「キラキラ」の重ね合わせ

たとえば、「ほとほと疲れた」「たびたび申し訳ない」「きらきら光る」など、こうした表現は繰り返しによって感情や感覚の密度を上げる働きを持っています。

「疲れた」だけではなく、「ほとほと疲れた」とすることで、絶望的な疲労感が韻律的に構築されます。これは論理的な意味の追加ではなく、感覚的な強度の積層と言えるでしょう。

習熟変容型：「Wax On Wax Off」の反復

一方で、映画『ベスト・キッド』に登場する「Wax On Wax Off」は、単純な動作の反復を通じて質的な転換を生み出す繰り返しです。

意味のないように見える動作が、やがて武術の基本動作へと変容していきます。これは、理解というよりも身体への刷り込みによる学習です。

共通点：「非命題的な知」

この二つの繰り返しに共通するのは、命題化されていない知（＝非命題的な知）であるという点です。
それはAIがもっとも苦手とする領域でもあります。

言語化される以前の、構造そのものとしての理解——それこそが、現代のAIが見落としている、人間知性の核心だと言えるのではないでしょうか。

第三章：ベクトル化にとって、「構造知性」はノイズである

ChatGPTやClaudeのようなLLMは、入力をトークンに分解し、ベクトル空間で処理しています。
しかしこのプロセスにおいて、構造知性はしばしば排除されてしまいます。

感覚的積層の消失

たとえば「キラキラ」は、そのままでは意味が固定されず、統計的にノイズとして扱われがちです。

音響的な韻律（KIRA/KIRA）はEmbeddingで消失
「きらきら光る」の文化的背景は、学習されていない限り意味に反映されない
「私はスター」という宣言の強度も、語の頻度とは無関係に処理される

習熟変容の不可視化

「Wax On Wax Off」のような繰り返しによる変容は、さらに捉えにくいです。

時間軸の体験は、ベクトル化で圧縮されてしまう
身体知への変容プロセスは数値化できない
無意識への刷り込みという学習形態を、AIは持たない

つまり、意味ではなく響きで届く言葉や、理解ではなく変容で学ぶ知識は、現在のAIアーキテクチャでは「ノイズ」として切り捨てられてしまうのです。

第四章：ブルース・リーの予言：「像を打て」

映画『燃えよドラゴン』の中で、ブルース・リーの師は弟子にこう語りかけます。

「忘れるな。敵は見せかけの“像”の姿で現れる」
「“像”を打て。敵は倒れる」

現代のAI開発もまた、この「像」の問題に直面しています。

AIが作り出す「人間像」

AIが構築する「人間像」は、以下のようなものです。

「平均的な日本語話者」という統計的存在
「効率的なコミュニケーション」を求める合理的ユーザー
「明確な意味」だけを重視する理想的話者

しかし、これらの像が前提になることで、実際の人間が持つ構造知性は見えなくなってしまうのです。

「Don’t think. Feel」の技術的意味

ブルース・リーのもう一つの有名な台詞があります。

“Don’t think. Feel. It’s like a finger pointing away to the moon.”

これは、過度に論理に偏ったAI開発への警鐘とも言えます。
「指（論理的処理）」ばかりを見て、「月（構造知性）」を見失っている──
この状態こそ、現代のAI設計が陥っている罠なのではないでしょうか。

第五章：LUCY的知性：構造が語る前に、言葉が生まれる

映画『LUCY』は、構造知性の極限的な表現を描いた作品です。

構造の直接的な認識

覚醒したルーシーは、時間を「旅する」のではなく、時間という情報構造そのものを一度に処理する存在へと変化します。
この点は、LLMのAttention機構と似ているようでいて、決定的な違いがあります。
それは、意味を介さずに、構造を直接認識するということです。

ケイシーの「構造直感」

また、映画『トゥモローランド』に登場するケイシーが、マニュアルも説明もなしに球体のUIを「なぜか使いこなせる」という場面も、構造知性の現れと言えるでしょう。

構造が語りかけてくる——そこには、AIにとって欠けている次元である「構造への感応」が存在しています。

第六章：構造知性のインターフェース設計：「像を打つ」実装とは

では、こうした構造知性を、技術の中にどう埋め込めばよいのでしょうか？

1. メタデータ拡張による構造の保持

たとえば以下のように、テキストに構造的な情報を添えることが考えられます。

Text: "ほとほと疲れた"
Vector: [0.2, -0.8, 0.5, ...]
Metadata: {
  repetition: "ほと", 
  intensity: 0.9, 
  cultural_context: "Japanese_exhaustion_emphasis",
  structure_type: "sensory_layering"
}

2. 「間」の演出による身体知の実装

日本語における「間」を意識したUIにおいては、意図的にリズムや沈黙を設計することで、身体知への接続が可能となります。

「ほとほと疲れた」と入力された際、0.8秒の遅延を挿入して深い共感を演出する
繰り返し表現を検出した場合に、視覚的な「重ね合わせ」の効果を表示する
オノマトペに対して、音響的なフィードバックを組み合わせる

3. 習熟変容を促すプログレッシブUI

「Wax On Wax Off」型の繰り返しによる習熟を支援するUIも、可能です。

ユーザーの理解度に応じて、応答の質を段階的に変化させる
使用の繰り返しにより、機能が自然に開放されていく
対話履歴からユーザーの「習熟パターン」を検出し、UI自体が動的に進化する

4. 構造アテンションの可視化

Attention機構を視覚化し、「構造的関係性」をユーザーに示すことで、人間の感性とAIの処理をつなげることができます。

「キラキラ」のような繰り返し部分を特別な色で表示
切り捨てられた構造情報を明示化
人間のフィードバックによって、アテンションの重みを調整可能にする

第七章：音のインターフェース：次世代協働の可能性

リズミックUI

言葉の「音」そのものを、インターフェースの中核に据えることができます。

音象徴に応じた視覚的エフェクト（例：「キラキラ」→ 光の粒の揺らぎ）
音の繰り返しパターンの検出に応じて、感情的な応答を調整
音韻と文化の背景を考慮した、多言語対応の仕組み

身体知に基づく設計思想

『トゥモローランド』のケイシーのように、「なぜか使える」UIを目指す設計です。

意味の説明よりも、構造の提示を優先する
論理的理解よりも、身体的慣れを重視する
完璧な機能よりも、段階的な習熟をサポートする

第八章：「切り捨て」を活かす設計哲学

制約としての創造性

ベクトル化によって生じる「意味の切り捨て」は、人間とAIの協働を補完的なものへと転換する可能性を秘めています。

AIの「平均的理解」＋人間の「構造直感」
統計的一貫性＋文化的機微
処理効率＋感覚的豊かさ

このように、両者の特性を活かすことで、より創造的で有機的なインターフェース設計が可能になります。

補完的協働の美学

従来のAI開発は「完璧な理解」を目指してきました。
しかし、これからの設計ではむしろ、「翻訳不可能性」を前提にし、人間が解釈の余地を持つ構造が重要になるのではないでしょうか。

AIは「語られたこと」を処理し、人間が「語られる前のもの」を感じ取る。
このような協働の非対称性こそが、新しい美学となるのです。

結論：AIは「キラキラ」を再構築できるか？

『Mamushi』の「キラキラ私はスター」は、多層的なリリックです。

このような多義的で感覚的な振動を、もしAIが「意味」として捉えきれないのであれば——
それを補うのは、人間の構造知性であるべきです。

だからこそ、韻律と身体知に対応した次の設計思想が、生成AIの未来を切り拓く鍵となるのです。

ブルース・リーの言葉のように、
統計的な「像」を打ち破り、真の知性＝構造知性と協働するAIを目指しましょう。

音のインターフェースは、その第一歩なのです。

Post Navigation