In this paper, they systematically study different ways of learning concept bottleneck models. Let be a loss function that measures the discrepancy between the predicted and true j-th concept, and let measure the discrepancy between predicted and true targets. They consider the following ways to learn a concept bottleneck model :
-
The independent bottleneck learns and independently: , and . While is trained using the true , at test time it still takes as input.
-
The sequential bottleneck first learns in the same way as above. But it uses the concept predictions to learn .
-
The joint bottleneck minimizes the weighted sum
-
The standard model ignores concepts and directly minimizes .
The hyperparameter in the joint bottleneck controls the tradeoff between concept vs. task loss. They also consider that the standard model is equivalent to taking , while the sequential bottleneck can be viewed as taking .
References
Koh, P. W., Nguyen, T., Tang, Y. S., Mussmann, S., Pierson, E., Kim, B., & Liang, P. (2020, November). Concept bottleneck models. In International conference on machine learning (pp. 5338-5348). PMLR.