What Time Tells Us?

Key Takeaways

Time becomes visible through illumination changes in what we see. Inspired by this, in this paper we explore the potential to learn time-of-day awareness from static images, trying to answer: what time tells us?

Learning from Static Images: We train models to understand time-of-day information from single images.
Time-Image Contrastive Learning (TICL): Our approach connects timestamps with visual representations.
Strong Pretext Performance: TICL achieves state-of-the-art results in timestamp estimation.
Useful for Various Downstream Tasks: TICL embeddings improve tasks like time-based retrieval and video classification.

📕Dataset

We introduce the Time-Oriented Collection (TOC) dataset, which consists of 130,906 images with reliable timestamps verified manually. This dataset enables us to analyze how time-related visual cues can be extracted from static images.

🔍What time tells us in Time-based Image Retrieval?

🏞️What time tells us in Video Scene Classification?

🌅What time tells us in Time-aware Image Editing?

✍Methodology on Pretext Estimation Task

Our proposed method, Time-Image Contrastive Learning (TICL), employs a cross-modal contrastive learning framework. Intuitively, time correlates to many of the metaphysical concepts that can be described in natural languages, this have motivated us to align CLIP image embeddings with our clock timestamp representations, allowing our model to learn time-related patterns from rich visual semantical features. The indirect correlations inherited from CLIP have help our method to outperform previous methods taking raw geolocation/date metadata (directly time-related!) as additional inputs.

🙏 Acknowledgements

This project is partially supported by the Royal Society grants (SIF\R1\231009, IES\R3\223050) and an Amazon Research Award. The computations in this research were performed using the Baskerville Tier 2 HPC service. Baskerville was funded by the EPSRC and UKRI through the World Class Labs scheme (EP\T022221\1) and the Digital Research Infrastructure programme (EP\W032244\1) and is operated by Advanced Research Computing at the University of Birmingham.

BibTeX

@misc{lin2025timetellsusexplorative,
        title={What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images}, 
        author={Dongheng Lin and Han Hu and Jianbo Jiao},
        year={2025},
        eprint={2503.17899},
        archivePrefix={arXiv},
        primaryClass={cs.CV},
        url={https://arxiv.org/abs/2503.17899}, 
  }

What Time Tells Us? An Explorative Study of Time-Awareness Learned from Static Images

Video Presentation