第五講：Perceiving Objects and Scenes

出自KMU Wiki

(修訂版本間差異)

跳轉到: 導航, 搜索

在2013年11月6日 (三) 11:34所做的修訂版本

Perceiving Objects and Scenes

Scenes

日常所見
困難所在

臉形辨識

困難處之一（角度）
在家中的畫面
- 機器人分析與錯誤

機器視覺

車速最高 50Km/hr
平均22Km/hr

Spirit in Mars

2004-1-4
Sashimi（生魚片）
Spirit
登陸艇
機械臂
Adirondack
形狀類似美國阿爾崗金族印第安人的帳篷故得名
get a X-ray

Optic

成像

Inverse projection problem

又稱 Inverse Optics

An environmental sculpture by Thomas Macaulay

從二樓陽台看
從一樓看
找一找鉛筆和眼鏡

這些人是誰？

Prince Charies, Woody Allen, Bill Clinton, Saddam Hussein, Richard Nixon, Princess Diana

從不同角度看

哪些照片是同一個人？

完形心理學

根據這些點(dot)累積創造出我們對臉的知覺的嗎？

似動現象(Apparent Movement)

由左至右的跑馬燈，似動現象(Apparent Movement)

錯覺輪廓(Illusory Contour)

Illusory contour

Good continuation (5.16)

Pragnanz

good figure / simple figure

相似性 law of similarity

相似的刺激傾向被知覺為一體
Similarity ?

proximity

共同區域(Common Region，圖5.18b)、
連結律(Uniform Connectedness，圖5.18c)、
以及同步發生(Synchrony，圖5.18d)
自然情境
Common fate

12張臉孔在內？

具有意義或對觀察者較熟悉的刺激，會被視為一體，

所以初看為一幅畫

你看到什麼？

知覺到的其中一個物體，另一個就成為背景
(a)實驗呈現方式
(b)當深色方塊被看成背景，則小黑點落在小方塊邊界上
(c)當被看成中間有孔的深色方塊，則小黑點落在黑色方塊邊界上
Figure ground separation 圖形背景分離
自然scene中的背景

convex region

meaningfulness

face ??

Gestalt law as Heuristics

Recognition by component

Non-accidental properties

上面的三條線
上面的兩條線
都是非偶發特性（non-accidental properties）
Biederman的實驗
Nonaccidental properties
- What's this?
6個geons的飛機
3個geons

的飛機

場景知覺（Gist of the Scene）

Mary Potter(1976)快速呈現16張複雜場景圖片，每張只呈現250毫秒。
在一連串呈現之前看到目標圖片或只是提示語（如：小女生在拍手）
結果受試者都可以看得到
Li Fei-Fei (2007)
利用masking每張圖片呈現27到500毫秒，每張圖片呈現完之後會呈現一個mask來精確控制刺激呈現時間
結果67ms 即可辨識（看到有人 :p）

Masking

masking procedure
抵消persistence of vision

Global Image Feature

Oliva and Torralba (2001, 2006)
- Degree of naturalness
- Degree of openness
- Degree of roughness
- Degree of expansion
- Color

Regularity

light-from-above heuristic

旋轉

Semantic Regularity

Hollingworth (2005)
- 前圖為例，條件為二
  - 目標出現
  - 目標不出現
- 作業
  - 目標出現在何處
  - 目標該出現在何處（在無目標條件下）
- 結果
  - 小圓：目標出現時
  - 大圓：目標不出現時
Palmer (1975)
先呈現給受試者流理台的圖片（圖5.39左）
然後再快速閃現另一組圖片（圖5.39右）
當後面出現的這張圖片為麵包的時候，受試者的正確率較高。
multiple personalities of blob:Oliva & Torralba(2007)

The role of inference in perception

theory of unconscious inference

當我們面對不清楚的刺激時，視覺系統會根據其他各種條件，比如過去經驗，去推測當下的刺激為何。這樣的知覺推理歷程稱之為theory of unconscious inference

likelihood principle
Bayesian inference

Neurons for grouping

原來對垂直線段有最佳反應的神經細胞(a)，若置於其它隨機角度的線段中，反應會受到抑制(b)，但若置於具有共線關係的線段中，便又使得反應增強(c)

Contextual modulation

Lamme (1995)
- 如果線條符合V1細胞的接受區形式且在「圖形」中：反應(a)
- 如果線條符合V1細胞的接受區形式但不在「圖形」中：不反應(b)

Sensory coding

在臉部知覺的部分，以下幾個區域會有怎樣的反應？

Grill-Spector(2004)之實驗

Grill-Spector(2004)之實驗結果
- fMRI資料顯示，FFA不只對是否看到臉有不同反應，也會因為受試者的反應正確與否而有不同的反應

Sheinberg and Logothesis (1997)

binocular rivalry
IT上的細胞會根據猴子的主觀知覺是看到類似太陽的刺激還是蝴蝶而有不同的反應

Tong等人(1998)實驗

當觀看者，經驗到非臉，則：PPA>FFA，經驗到臉，則FFA>PPA

Voxels

Voxels(volumetric pixel)
fMRI Voxels2~3mm立方體
Kamitani and Tong(2005) 可以由Voxels預測觀察者所見的傾斜方向

Kamitani and Tong(2005)

發現腦部反應可以預測觀察者的注意力是在哪個傾斜刺激上

Kay等人(2008)實驗

呈現1,750黑白照片
測量V1的500個voxels
整理voxels之活動之後
- 經過數學處理
由voxels的活動猜測觀察者所見
- 正確率72~92%
Kay等人(2008)實驗結果

Structure encoding

觀看1750張圖
測量各voxel的活動
每一voxel對於不同圖的反應特性來計算該voxel 的特徵

Are faces special?

face!!
- negative image

face in brain

可能有關的
development
development inf brain
Think about

取自"http://wiki.kmu.edu.tw/index.php/%E7%AC%AC%E4%BA%94%E8%AC%9B%EF%BC%9APerceiving_Objects_and_Scenes"

 ==Perceiving Objects and Scenes==
-===Scenes===
+==Scenes==
 *日常所見
 *困難所在
-====臉形辨識====
+===臉形辨識===
 *困難處之一（角度）
 *在家中的畫面
 **機器人分析與錯誤
-====機器視覺====
+===機器視覺===
 *車速最高 50Km/hr
 *平均22Km/hr
-===Spirit in Mars===
+==Spirit in Mars==
 *2004-1-4
 *Sashimi（生魚片）
 *形狀類似美國阿爾崗金族印第安人的帳篷故得名
 *get a X-ray
-===Optic===
+==Optic==
 *成像
-====Inverse projection problem====
+===Inverse projection problem===
 *又稱 Inverse Optics
-====An environmental sculpture by Thomas Macaulay====
+===An environmental sculpture by Thomas Macaulay===
 *從二樓陽台看
 *從一樓看
 *找一找鉛筆和眼鏡
-====這些人是誰？====
+===這些人是誰？===
 Prince Charies, Woody Allen, Bill Clinton, Saddam Hussein, Richard Nixon, Princess Diana
-====從不同角度看====
+===從不同角度看===
-====哪些照片是同一個人？====
+===哪些照片是同一個人？===
-===完形心理學===
+==完形心理學==
 *根據這些點(dot)累積創造出我們對臉的知覺的嗎？
-====似動現象(Apparent Movement)====
+===似動現象(Apparent Movement)===
 *由左至右的跑馬燈，似動現象(Apparent Movement)
-==== 錯覺輪廓(Illusory Contour)====
+=== 錯覺輪廓(Illusory Contour)===
-====Illusory contour====
+===Illusory contour===
-====Good continuation (5.16)====
+===Good continuation (5.16)===
-====Pragnanz====
+===Pragnanz===
 *good figure / simple figure
-====相似性 law of similarity====
+===相似性 law of similarity===
 *相似的刺激傾向被知覺為一體
 *Similarity ?
-====proximity====
+===proximity===
 *共同區域(Common Region，圖5.18b)、
 *連結律(Uniform Connectedness，圖5.18c)、
 *自然情境
 *Common fate
-====12張臉孔在內？====
+===12張臉孔在內？===
 *具有意義或對觀察者較熟悉的刺激，會被視為一體，
 所以初看為一幅畫
-====你看到什麼？====
+===你看到什麼？===
 *知覺到的其中一個物體，另一個就成為背景
 *(a)實驗呈現方式
 *Figure ground separation 圖形背景分離
 *自然scene中的背景
-====convex region====
+===convex region===
-====meaningfulness====
+===meaningfulness===
 *face ??
-====Gestalt law as Heuristics====
+===Gestalt law as Heuristics===
-====Recognition by component====
+===Recognition by component===
-====Non-accidental properties====
+===Non-accidental properties===
 *上面的三條線
 *上面的兩條線
 *3個geons
 的飛機
-===場景知覺（Gist of the Scene）===
+==場景知覺（Gist of the Scene）==
 *Mary Potter(1976)快速呈現16張複雜場景圖片，每張只呈現250毫秒。
 *在一連串呈現之前看到目標圖片或只是提示語（如：小女生在拍手）
 *利用masking每張圖片呈現27到500毫秒，每張圖片呈現完之後會呈現一個mask來精確控制刺激呈現時間
 *結果67ms 即可辨識（看到有人 :p）
-====Masking====
+===Masking===
 *masking procedure
-*抵消�persistence of vision
+*抵消persistence of vision
-====Global Image Feature====
+===Global Image Feature===
 *Oliva and Torralba (2001, 2006)
 **Degree of naturalness
 **Degree of expansion
 **Color
-====Regularity====
+===Regularity===
-====light-from-above heuristic====
+===light-from-above heuristic===
 *旋轉
-====Semantic Regularity====
+===Semantic Regularity===
 *Hollingworth (2005)
 **前圖為例，條件為二
 *當後面出現的這張圖片為麵包的時候，受試者的正確率較高。
 *multiple personalities of blob:Oliva & Torralba(2007)
-====The role of inference in perception====
+===The role of inference in perception===
-====theory of unconscious inference====
+===theory of unconscious inference===
 *當我們面對不清楚的刺激時，視覺系統會根據其他各種條件，比如過去經驗，去推測當下的刺激為何。這樣的知覺推理歷程稱之為theory of unconscious inference
 *likelihood principle
 *Bayesian inference
-====Neurons for grouping====
+===Neurons for grouping===
 原來對垂直線段有最佳反應的神經細胞(a)，若置於其它隨機角度的線段中，反應會受到抑制(b)，但若置於具有共線關係的線段中，便又使得反應增強(c)
-====Contextual modulation====
+===Contextual modulation===
 *Lamme (1995)
 **如果線條符合V1細胞的接受區形式且在「圖形」中：反應(a)
 **如果線條符合V1細胞的接受區形式但不在「圖形」中：不反應(b)
-===Sensory coding===
+==Sensory coding==
 *在臉部知覺的部分，以下幾個區域會有怎樣的反應？
-====Grill-Spector(2004)之實驗====
+===Grill-Spector(2004)之實驗===
 *Grill-Spector(2004)之實驗結果
 **fMRI資料顯示，FFA不只對是否看到臉有不同反應，也會因為受試者的反應正確與否而有不同的反應
-====Sheinberg and Logothesis (1997)====
+===Sheinberg and Logothesis (1997)===
 *binocular rivalry
 *IT上的細胞會根據猴子的主觀知覺是看到類似太陽的刺激還是蝴蝶而有不同的反應
-====Tong等人(1998)實驗====
+===Tong等人(1998)實驗===
 *當觀看者，經驗到非臉，則：PPA>FFA，經驗到臉，則FFA>PPA
-====Voxels====
+===Voxels===
 *Voxels(volumetric pixel)
 *fMRI Voxels2~3mm立方體
 *Kamitani and Tong(2005) 可以由Voxels預測觀察者所見的傾斜方向
-====Kamitani and Tong(2005)====
+===Kamitani and Tong(2005)===
 *發現腦部反應可以預測觀察者的注意力是在哪個傾斜刺激上
-====Kay等人(2008)實驗====
+===Kay等人(2008)實驗===
 *呈現1,750黑白照片
 *測量V1的500個voxels
 **正確率72~92%
 *Kay等人(2008)實驗結果
-====Structure encoding====
+===Structure encoding===
 *觀看1750張圖
 *測量各voxel的活動
 *每一voxel對於不同圖的反應特性來計算該voxel 的特徵
-===Are faces special?===
+==Are faces special?==
 *face!!
 **negative image
-====face in brain====
+===face in brain===
 *可能有關的
 *development
 *development inf brain
 *Think about

第五講：Perceiving Objects and Scenes

出自KMU Wiki

在2013年11月6日 (三) 11:34所做的修訂版本

目錄

Perceiving Objects and Scenes

Scenes

臉形辨識

機器視覺

Spirit in Mars

Optic

Inverse projection problem

An environmental sculpture by Thomas Macaulay

這些人是誰？

從不同角度看

哪些照片是同一個人？

完形心理學

似動現象(Apparent Movement)

錯覺輪廓(Illusory Contour)

Illusory contour

Good continuation (5.16)

Pragnanz

相似性 law of similarity

proximity

12張臉孔在內？

你看到什麼？

convex region

meaningfulness

Gestalt law as Heuristics

Recognition by component

Non-accidental properties

場景知覺（Gist of the Scene）

Masking

Global Image Feature

Regularity

light-from-above heuristic

Semantic Regularity

The role of inference in perception

theory of unconscious inference

Neurons for grouping

Contextual modulation

Sensory coding

Grill-Spector(2004)之實驗

Sheinberg and Logothesis (1997)

Tong等人(1998)實驗

Voxels

Kamitani and Tong(2005)

Kay等人(2008)實驗

Structure encoding

Are faces special?

face in brain

檢視

個人工具

導航

搜索

工具箱