This paper proposes Randomized Path-Integration (RPI), which generates diverse attribution maps, enabling the selection of the most effective one.

This work makes two key contributions:

It employs a Gaussian diffusion process to generate a list of baselines, aiding in the identification of the most effective attribution map.
Instead of integrating over the inputs, it integrates over the attention scores.

The computation of RPI is as follows:

Step 1: Generating baselines using the Gaussian diffusion process, which is as follows:

$$ \mathcal{B}_t=\mathcal{N}\left(\sqrt{\bar{\alpha}_t} \mathbf{a}_u^l,\left(1-\bar{\alpha}_t\right) \mathbf{I}\right), \bar{\alpha}_t=\prod_{j=1}^t \alpha_j $$

Step 2: Generating interpolated points.

$$ \mathbf{v}^{lr} = \mathbf{b}^{lr} + a (\mathbf{a}^{lr} - \mathbf{b}^{lr}), \mathbf{b}^{lr} \in \mathcal{B}_t $$

Step 3: Applying the Riemman sum to approximate the integral.

$$ \mathbf{m}^{lr} = \phi(\frac{\mathbf{a}^{l} - \mathbf{b}^{lr}}{n} \cdot \sum^{n}_{j=1}\frac{\partial F_{y}}{\partial \mathbf{v}^{lr}} \cdot \mathbf{v}^{lr}) $$

where $\mathbf{m}^{lr}$ is the explanation.

Reference

EMNLP findings 2024 Improving LLM Attributions with Randomized Path-Integration

Reference#

Reference