# Tensor Decomposition for Salient Object Detection in Images

## Abstract

## 1. Introduction

## 2. Related Works

**F**is decomposed into the sum of a low-rank matrix $L\in {\Re}^{N\times D}$ and sparse matrix $S\in {\Re}^{N\times D}$ by solving optimization problem in Equation (1). $||\cdot |{|}_{*}$ stands for the nuclear norm which computes the sum of the singular values of a matrix. It is a convex relaxation of the matrix rank function. $||\cdot |{|}_{1}$ represents the l

_{1}-norm to promote the sparsity. λ is a constant to balance the tradeoff between the low-rank term and the sparse term.

## 3. Proposed Tensor Decomposition Method for Saliency Detection

_{1}-norm for the high-dimensional tensor case with the order greater than three. Based on the relationship between high-dimensional tensor and matrix, we use the following tensor trace norm [25] for a n-order tensor $\mathbf{X}$:

_{1}-norm:

_{1}-norm) of a high-dimensional tensor is a linear combination of the trace norms (or l

_{1}-norms) of all matrices unfolded along each mode. Note that, the l

_{1}-norm definition in this article is different from the general norm definition of a tensor in [19].

_{1}-norm along each tube fibers, i.e., ${\mathbf{S}}_{ij:}$, of tensor $\mathbf{S}$ is used to measure the saliency of corresponding segmented region in the i-th row and j-th column of the superpixels. The final saliency map is then accordingly generated and normalized to a 0 to 1 scale image.

## 4. Experimental Results

#### 4.1. Benchmark Datasets and Evaluation Criteria

_{β}-measure. Specifically, MAE was used to measure the average pixel-wise absolute difference between detected saliency map and corresponding ground-truth saliency map. The OR was used to measure the importance of complete detection of salient objects. The F

_{β}-measure is defined as

_{β}= (1 + β

^{2})·P·R/(β

^{2}·P + R)

^{2}= 0.3 is to ensure more stress on precision than recall [33]. The weighted F

_{β}–measure (WF) was also used to measure the importance of saliency detection performance. Precision (P) equals the percentage of salient pixel correctly assigned. Recall (R) is the ratio between the correctly detected salient pixels and true saliency pixel in the ground-truth saliency map.

#### 4.2. Efficient of the Heuristic Tensor Decomposition Process

#### 4.3. Saliency Detection Performance Comparison

## 5. Discussion

## 6. Conclusions

**Figure 2.**Saliency map comparison. (

**a**) Image; (

**b**) tensor decomposition original (TDO) saliency map (with sub-problem solutions integrated); (

**c**) ground-truth saliency map; (

**d**) sub-problem 1 saliency map; (

**e**) sub-problem 2 saliency map; (

**f**) sub-problem 3 saliency map.

**Figure 3.**Quantitative comparison on three datasets in terms of F-measure curve. (

**a**) F-measure comparison on iCoSeg dataset; (

**b**) F-measure comparison on ECSSD dataset; (

**c**) F-measure comparison on MASRA10K dataset.

**Figure 4.**Visual comparison of saliency maps of different methods. (

**a**) Image; (

**b**) ground-truth; (

**c**) TDS; (

**d**) TDO; (

**e**) WLRR; (

**f**) SMD; (

**g**) SLR; (

**h**) DRFI; (

**i**) RBD; (

**j**) HCT.

**Table 1.**Result on iCoSeg dataset in terms of weighted F

_{β}–measure (WF), mean absolute error (MAE), overlapping ration (OR), and area under the RO curve (AUC).

Metric | TDS | TDO | SMD | WLRR | SLR | DRFI | RBD | HCT |
---|---|---|---|---|---|---|---|---|

WF↑ | 0.641 | 0.568 | 0.611 | 0.602 | 0.473 | 0.592 | 0.599 | 0.464 |

OR↑ | 0.601 | 0.564 | 0.598 | 0.578 | 0.505 | 0.582 | 0.588 | 0.519 |

AUC↑ | 0.839 | 0.836 | 0.822 | 0.843 | 0.805 | 0.839 | 0.827 | 0.833 |

MAE↓ | 0.134 | 0.162 | 0.138 | 0.147 | 0.179 | 0.139 | 0.138 | 0.179 |

**Table 2.**Result on extended complex scene saliency dataset (ECSSD) dataset in terms of WF, MAE, OR, and AUC.

Metric | TDS | TDO | SMD | WLRR | SLR | DRFI | RBD | HCT |
---|---|---|---|---|---|---|---|---|

WF↑ | 0.551 | 0.502 | 0.517 | 0.500 | 0.442 | 0.517 | 0.490 | 0.430 |

OR↑ | 0.520 | 0.506 | 0.523 | 0.498 | 0.474 | 0.527 | 0.494 | 0.457 |

AUC↑ | 0.815 | 0.818 | 0.775 | 0.819 | 0.764 | 0.780 | 0.752 | 0.755 |

MAE↓ | 0.185 | 0.208 | 0.227 | 0.211 | 0.252 | 0.217 | 0.225 | 0.249 |

Metric | TDS | TDO | SMD | WLRR | SLR | DRFI | RBD | HCT |
---|---|---|---|---|---|---|---|---|

WF↑ | 0.713 | 0.652 | 0.704 | 0.659 | 0.601 | 0.666 | 0.685 | 0.582 |

OR↑ | 0.706 | 0.690 | 0.741 | 0.696 | 0.692 | 0.723 | 0.716 | 0.674 |

AUC↑ | 0.849 | 0.849 | 0.847 | 0.853 | 0.840 | 0.857 | 0.834 | 0.847 |

MAE↓ | 0.108 | 0.131 | 0.104 | 0.130 | 0.141 | 0.114 | 0.108 | 0.143 |

