This study examines pre-service teachers’ vlogs as multimodal texts and explores their narrative types and meaning-making processes using a 3+1 analytical framework that integrates Systemic Functional Linguistics (SFL), visual grammar, and multimodal hermeneutics. The data consists of ten vlogs produced by pre-service teachers at a university in the Seoul metropolitan area, along with their production plans and storyboards. These were qualitatively analyzed by coded by scene, shot, and sound. The analysis identified five narrative and cohesion strategies including music, captions, editing, gaze, and place markers. Furthermore, the study revealed that participants embedded roles and values related to teaching within their personal narratives, which were metaphorically represented through the vlog structures. These findings suggest that vlogging can serve as an educational medium that holistically enhances both identity reflection and visual literacy in pre-service art teacher education.