00:00:00.71, the man first looks at the woman before turning his gaze toward the camera. His right hand is already on her shoulder. He then slowly moves his hand slightly to the right and gently clenches her shoulder at , and at , he blinks his eyes. Both are now looking at the camera. The lighting is natural daylight. The video starts with a close-up shot, and the camera slightly tilts upward." Comment: This annotation is clear and objective, includes detailed descriptions of characters and their appearance, positioning and movements. Has precise timestamps for actions and lighting conditions and camera movement are well described. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/337407-clip_000000 00_1.mp4 "The video takes place in a bright green background with even lighting, capturing a woman dancing throughout. The focus is on a young woman with long, wavy, light brown hair. She wears an oversized white long-sleeve shirt, round earrings and interacts with her hair while posing. At various points in the video, she looks up and smiles. At , she raises her left hand and touches her hair with her fingers. Then, at , she raises her right hand, touches her shoulder, moves her right hand through her hair, and returns her left hand to her side at . She begins jumping four times, moving both hands up and down alternately and smiling. At , she looks directly at the camera and flips her hair upward with both hands. At , she looks up and stretches her arms above shoulder level and pucker whistling, lowering them below head level at . Her movements gently reflect her shadow on the background. The camera remains static with a medium shot angle throughout the video. The lighting conditions are artificial indoor lighting." Comment: clear and detailed action breakdown with accurate timestamps and no subjectivity. Well described lighting condition and camera movements hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/254480-clip_000000 00_0.mp4 The scene is set inside a building that appears to be a doctor's oice. In the background are large windows covered by translucent blinds, through which sunlight enters and illuminates the space. Beyond the windows, there are faint silhouettes of skyscrapers that are blurred out. In the middle of the windows is a pillar covered in red brick tiles. There are two female subjects present in the frame, both of whom are wearing flu masks and dark blue sanitary gloves, and have slightly tanned skin tones with dark hair. The first subject on the left has shoulder-length straight black hair and is wearing a brown buon-down shirt. She is clutching a silver tablet device with a black case and a dark blue piece of paper in her right arm. Throughout the video, she talks to the subject on the right while looking at her and briefly looks away from the second subject at until without changing the direction of her head, referring to a white piece of paper by signaling at it with her hand. At to , the subject on the left gently nods her head. She also shrugs her left shoulder slightly while nodding her head at . The second subject is wearing a pastel pink buon-down shirt with a white lab coat. She has curly hair styled in a single ponytail and is holding up two pieces of paper in one hand, looking at their contents throughout the video while the first subject talks. Both subjects have neutral facial expressions, which can be inferred from their eyes and eyebrows, as their faces are covered with masks. The camera is moving towards the left side of the frame at a fixed height, capturing the subjects above the waist. The scene is lit by the natural sunlight coming through the windows. Comments: well structured and detailed annotation with good timestamps and camera description. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/365681-clip_000000 00_0.mp4 The scene is set outdoors in a forest filled with tall green trees and a pale blue twilight sky. The subject is a young white blonde woman with blue eyes who is wearing a white sleeveless dress. The scene begins with the woman facing forward and walking ahead, with the camera capturing her from the back at an angle biased toward her left side. At , the woman begins to turn back to look over her left shoulder. At , she has turned her body almost fully around to look back and slightly above her eye level, wearing an anxious expression that conveys fear. At , she begins to turn back, and by , she has completed turning her head forward again, with her hair flowing in the direction of her head movement and starts to pick up the pace. As she begins walking faster, her loose hair flows and bounces, reflecting her hurried manner of walking. The lighting is natural, and the camera follows the woman's movement, capturing her only from the waist up. The background is blurred with a bokeh effect. Comment: Strong visual detail, with clear timestamped and camera movement. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/131239-clip_0000000 0_0.mp4 The scene is set in the afternoon in a jungle, featuring a paved path slightly covered with rocks and surrounded by tall trees and short grass, with sunlight filtering through the foliage. The video begins with the subject out of frame. The subject is a thin white female with dark, short, and curly hair, running for exercise while wearing black capri pants and a light maroon long-sleeve t-shirt, along with gray sneakers and white socks. At the mark, the subject enters the frame from the left side, running in the same direction as the camera. Upon entering the frame, the subject gradually overtakes the camera, and the distance between her and the camera increases as the video progresses. Comment: Clear and focused description of environment and movement, timestamps well used and camera movement well described in relation with the subject hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/253997-clip_000000 00_0.mp4 The scene begins with a female subject who in her early twenties, is thin and has light skin and dark hair styled in a ponytail that reaches the length of her back. She is wearing beige trousers and a black and white Aztec diamond-patterned overshirt over a white T-shirt, along with silver drop earrings. She stands beside a river that flows parallel to the direction of the camera. On the left side of the frame is the subject, who is standing atop small grass-covered rock formations and tree roots, with foliage of varying heights in the background and a tree about 8 feet behind her. The right side of the frame is dominated by the flow of the river, which is pale blue in color and has a rock protruding from the surface. In the distance, on the ground of the right side of the frame next to the river, there is some foliage. As the video progresses, the foliage sways slightly due to the wind, along with the overshirt of the female subject. The subject displays an expression of calmness and pleasure in the clip, momentarily closing her eyes as well. At the mark, she drops her hand, which was originally over her head, and looks towards the camera. At , she raises her hand to her collarbone while tilting her head and closing her eyes. Afterwards, she lowers her hand and wraps it around her waist. During the length of the video, her right hand remains stationary and rests beside her leg. At , as the camera pans to the right, a second rock protruding from within the river comes into view. The camera is positioned at a fixed distance from the subject but occasionally pans around and changes slightly. The scene is illuminated primarily by warm sunlight, as well as by light reflected o the water. Comment: Highly detailed and well structured annotation, with clear timestamped actions, and accurate camera behavior well described. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/199735-clip_000000 00_0.mp4 The scene takes place indoors in an oice, featuring a white man in his forties with gray hair and a short beard. He is wearing a blue, long-sleeve buon-down shirt and gray pants, and he is wearing glasses. He is seated at his gray desk, operating a desktop computer with his left hand while taking notes with his right hand. A metal table lamp, which is lit, is positioned to the right of the monitor. In the background, a glass wall consists of white beams serving as panels, through which another oice, mostly obscured and illuminated with neon blue light, can be seen. A white blonde woman is seated at her desk, facing to the right of the frame. Between and , he looks back and forth between his notebook and the monitor before focusing on the computer screen. Over his right shoulder, on the glass wall in the background, is a sketch of the front of a car, while over his left shoulder is a sketch of the top view of the car from an angle. At , he stops taking notes and presses a single key on the keyboard, looking at the camera with a neutral expression. The majority of the lighting is dim and artificial, with a blue hue, likely projecting from ceiling lights. Additionally, the scene is illuminated by a table lamp which is projecting warm light on to the desk of the main subject. Further, there are neon blue light fixtures in the background that are adding illumination. The camera zooms in slowly throughout the scene. Comment : detailed and objective description with well used timestamps, great camera and lighting condition description. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/376414-clip_000000 00_0.mp4 The scene is set at sunset in a forest with dense trees in the background, featuring a patch of trees on the left side of the frame that has a few bright yellow leaves. Small clouds of smoke linger behind the subject, who is a bald, slender, gray-haired monk in his 50s, wearing a deep red Buddhist robe. The monk is seated still with his eyes closed, wearing a neutral expression while meditating. He maintains this position for the duration of the clip, while the camera gradually moves closer to him as the clip progresses.- The camera captures only the monk from the waist up and zooms in at an upward angle. The scene is illuminated by the warm natural sunlight from sunset through the trees and leaves in the background. Comment: Strong environmental detail and lighting description, camera movement clearly described. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/374764-clip_000000 00_0.mp4 The scene is set on a street surrounded by various buildings. The o-white building on the left side of the frame features revivalist architecture and extends into the distance, while the one on the right has modern architecture and is also o-white in color. The subject is Black, with curly hair and a high fade haircut. He is wearing a thin grey turtleneck sweater and a black backpack with yellow and black steel zippers, along with a leather strap chronograph wristwatch and a face mask. In the distance, there are modern skyscrapers primarily made of glass. The video begins with the subject looking down at the screen of his phone, which he is holding with both hands while typing. He is standing next to the upper part of the stairway leading to the subway below. There is a black sign overhead the staircase indicating the name of the station, i.e., 34 St-Penn Station, in white and blue font. The middle rail of the stairway is slightly visible on the left side of the frame. At the mark, a man wearing a white and blue polo shirt and a white hat starts to walk across the bridge, on the wall of which the aforementioned sign is aached. At the mark, a steel grey hatchback drives across the road into the distance from the right side of the frame. The camera is handheld at a fixed position, capturing the subject from the waist up. In terms of lighting, the environment is naturally sunlit with overcast clouds. Comment: Great details for the character and well timestamped actions, camera and lighting condition are described clearly. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/378498-clip_000000 00_0.mp4 The scene begins with a young Black man standing and clutching thin window bars in a dark, poorly lit room, looking through them at the outdoor environment that appears to contain trees in the distance beyond a large empty courtyard. He is standing in the right half of the frame. The subject is wearing a raglan half-sleeve T-shirt with a white torso and black sleeves, along with black true wireless earphones. He has short, curly black hair and a low fade haircut, and a neutral expression on his face as he peers through the window bars. At the mark, the camera moves forward enough to crop out the background of the interior while geing closer to the subject, who is only visible from above the shoulders, along with his forearms and hands. The camera is positioned to the left side of the subject. The scene is primarily illuminated by sunlight and the white tube light in the interior. Comment : Well balanced annotation, objective and great character description. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/275926-clip_000000 00_1.mp4 The video shows a person on the left side of the screen, kneeling on a wooden plank surface as he plants tubers into the dark soil. He is wearing blue jeans, a gray knied sweater, and black gloves. Beside his knee is a white container holding the tubers. The soil, positioned on the right side of the screen, has been dug into a trench, forming a small mound along the edge. One tuber is already planted at the far end. At , the person places a tuber into the soil. Then he moves his hand into the container, picks another one, and plants it at . He repeats this process , steadily working along the dugout space. The camera starts o still, then gradually tilts upward, capturing the scene. Comment : Great scene description with accurate layout and it is objective. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/291922-clip_000000 00_1.mp4 The video is set indoors in a studio, with a plain black background that has a round light source attached in the middle to cast a warm light, making the subject appear as a silhouette. From what can be inferred, the subject is wearing waist-high flare pants and a tucked-in long-sleeve shirt. The subject has short hair, and in this scene, they are performing hand combat moves using a small knife. At , the subject is facing their body towards the camera while turned to their left, wielding the knife in their right hand. They raise the knife upwards with an arm movement while throwing a punch with their other hand until the mark. Afterwards, they take a neutral stance and proceed to swing the blade over and then under their hand to their right side while executing an additional move in that direction with the blade at eye level at . At , the subject adopts another neutral stance, keeping the blade and their free hand close to their body while looking to the right. By , they proceed to throw another combination involving a low blow and another strike at eye level while facing their right side. They repeat the combination one more time, this time having their left arm resting against the side of their waist by . The video concludes with the subject initiating the aforementioned combination again in the same direction, which they start for the last time at , this time having their left arm raised to their head in a defensive position. The camera remains static, with the subject placed in the middle of the frame the entire time. The scene is lit using a single warm elliptical light source in the background. Comment : Well detailed and structured annotation with good timestamped actions and an objective tone. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/285583-clip_000000 00_1.mp4 A woman in her twenties with a fair skin tone and golden brown hair tied behind her head swims toward the foreground right of the frame wearing a black scuba diving and yellow gloves. The oxygen cylinder behind her back has a pale yellow shade. She holds a small metallic silver rectangular object between her left index finger and thumb. The flaps on her feet are also black. She is visible in the middle of the screen at in a horizontal swimming position with her body extending from the background toward the foreground right and her face turned toward the camera. Water bubbles are visible rising above her head from both side of her face. Another person wearing the same is visible in the background. They are also swimming toward the camera. The surrounding is filled with bluish water with a rocky boom covering most of the screen in the foreground in the boom half and the middle ground in the top half. They gradually become blurred toward the background. The rocks in the foreground are visibly covered with thin patches of green algae. Small gray fishes swim around the woman and are visible near the top edge. As the video starts, the woman and the person in the background continue to swim and two fishes from the top swim and come near the foreground on the right side. They have small yellow tails, a white body with a black strip on top. The woman then starts turning her head at to look at the fishes. her eyes then follow the movement of a fish that starts swimming toward the boom from the front of the woman. She also drops the object from her hand at which is revealed to be aached with a black stick extending from her. She then extends her right hand to touch the fish and moves her hand near the fish such that the fish touches the back of her hand at . The woman's gaze follows her. The camera pans a bit toward the left at and then again starts panning to the right while moving a bit upward. The woman stops swimming and stays at one place from as she looks at the camera. She is very close to the camera and is visible mostly on the left half of the screen at the end at . Comment: well detailed and accurate annotation with great use of timestamps on actions, clear camera description. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/208037-clip_000000 00_0.mp4 The video shows an older woman in an outdoor setting surrounded by various flowers and green plants. The flowers and vases, in shades of brown, black, and gray, are lined up on the right side of the screen against a dark-framed wall. Some flowers are nestled among green plant stems, with white, pink, and a hint of purple flowers visible. Behind the woman, there’s a brown brick wall and a gray door frame on the left, leading to another area with billowing plants. The woman has short white hair and is dressed in an o-white shirt with buons down the front, paired with blue jeans. She wears silver dangling earrings in her right ear. Initially bending over, she straightens up, at pulls away from a black vase, and at places her left hand on a small brown vase. At she touches a white flower with her left hand and another with her right, smiling as she admires them. Finally, at she stretches her right hand to caress one of the green leaves. Comment: well detailed annotation with accurate timestamped actions and an objective tone. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/239838-clip_000000 00_0.mp4 A woman walks on a flat sandy terrain outdoors. She wears a wide-brimmed hat, sunglasses, a face mask pulled below her chin, a rust-colored jacket over a pink t-shirt, and a backpack which she is wearing on her shoulders. She also has long dark hair. The background features a desert-like surface with vehicle tire marks behind her. On the left side of the frame, a parked silver SUV with a black roof cargo bag is visible. The sun is low on the horizon behind her, creating natural lighting and casting shadows on the ground. At she starts walking forward while holding her backpack straps. At she turns her head slightly to the right while continuing to walk she continues to turn her head until she looks almost behind her and sun now reflecting to her face at The camera remains static, capturing the subject in a wide shot for the duration of the video. The scene is illuminated by the sunlight emitted from the setting sun. Comment: well detailed annotation with accurate timestamped actions and an objective tone. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/155361-clip_0000000 0_0.mp4 The scene is set inside a gymnasium with large window panels on the upper walls, allowing ample sunlight to stream in, creating a cooler tone. Below the windows are two wall-mounted air conditioners positioned far apart, and an LED scoreboard displays a score of 12:11. The floor of the gymnasium is painted blue, with a green area in the middle. The two subjects are fencers in standard white fencing aire, engaged in a duel, each having a cable attached to the back of their aire. The video begins with the fencers standing face to face, legs wide apart, holding up their foils and looking for an opportunity to strike. The fencer on the right, whose body is facing the camera, is advancing toward the fencer on the left, who is facing away from the camera and gently stepping back. At the mark, the fencer on the left lifts his foil, and at , both fencers lower their foils to feint at each other. The fencer on the right has advanced further to the left of the frame, while the fencer on the left continues to retreat. At , the fencer on the left raises his foil, while the fencer on the right lowers his once again. At , the fencer on the right retreats with his lowered foil and strikes forward, moving his body ahead and taking a wide step to make contact with his rival. The fencer on the left lowers his foil in an attempt to carry the pack, but ultimately fails. The camera dollies to the left of the frame at a constant speed, tracking the movement of the subjects. The environment is naturally lit due to the sunlight coming through the windows. Comment: Well detailed movement breakdown, timestamps well placed and camera clearly described. Bad task examples: 1. hp://ai-lumalabs-uber-labelling.s3-us-west-1.amazonaws.com/avlm_benchmark/177736-clip_0 0000000_0.mp4 "The scene shows a fair skinned woman with blonde long hair in a brightly lit room looking down at her phone during the day. She is wearing a black sleeveless dress and holding a phone with a brown pouch on her right hand.