Multi-Modal Automatic Video Chaptering using Large Language Models