Hi Rayhane, I'm currently looking into your LocationSensitiveAttention class and don't

Hello <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-ur

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

A Doubt In LocationSensitiveAttention about tacotron-2 HOT 3 CLOSED

rayhane-mamah commented on May 20, 2024

A Doubt In LocationSensitiveAttention

from tacotron-2.

Comments (3)

begeekmyfriend commented on May 20, 2024

It helps convergence. You can see the alignment improved when adding cumulative attention states on the 4th floor of this issue keithito/tacotron#170 (comment). I have made series of ablation studies to confirm it.

from tacotron-2.

Rayhane-mamah commented on May 20, 2024

Hello @StevenZYj, thanks for reaching out!

As stated by @begeekmyfriend, attention weights cumulation is a must to get proper alignments. This was actually stated in the paper as well:
"We use the location-sensitive attention from [21], which extends the additive attention mechanism [22] to use cumulative attention weights from previous decoder time steps as an additional feature. This encourages the model to move forward consistently through the input, mitigating potential failure modes where some subsequences are repeated or ignored by the decoder."

While the entire attention mechanism was just referenced in few words in the paper, by using the references they provide we managed to get a sense of what they are talking about. I believe what confirms this cumulation approach is the fact that when doing ablation studies (By the way great work @begeekmyfriend you never cease to impress!), we found out that when we don't use weights cumulation, the decoder tends to repeat of ignore some subsequences.

from tacotron-2.

StevenZYj commented on May 20, 2024

@begeekmyfriend @Rayhane-mamah Thanks a lot! It's my bad that the point is actually in the tacotron 2 paper.

from tacotron-2.

Recommend Projects

A Doubt In LocationSensitiveAttention about tacotron-2 HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent