Track from experiences on audio generation based on pictures.