How We See

Visual compositional techniques in photography are not merely artistic conventions but are deeply rooted in the neurological mechanisms that govern human visual perception. Neuroscience research has revealed that many established photographic guidelines align remarkably well with how our brains process visual information. This article examines the biological underpinnings of visual perception and explains the neural basis for various compositional techniques in photography.

Fundamentals of Visual Processing

The Visual Pathway: From Eye to Brain

The human visual system transforms light energy into meaningful perceptual experiences through a sophisticated process—ironically far more akin to modern computational photography than to traditional cameras. Light enters the eye through the cornea, passes through the pupil and lens, and becomes focused on the retina. Specialised photoreceptors—rods and cones—convert light into electrical signals that travel along the optic nerve.

The optic nerves from each eye meet at the optic chiasm, where nerve fibres cross. The right visual field is represented in the left hemisphere, and vice versa. Visual information then travels primarily to the lateral geniculate nucleus (LGN) of the thalamus before proceeding to the visual cortex at the rear of the brain.

Visual processing continues through a hierarchy of specialised regions:

  • V1 (primary visual cortex): Processes basic features like orientation, motion direction, and depth

  • V2 (secondary visual cortex): Handles figure/background distinctions and illusory contours

  • V3: Processes global motion and covers larger portions of the visual field

  • V4: Plays a crucial role in colour processing and shape recognition

The Dual-Stream Processing Model

Beyond the early visual cortex, visual information follows two primary processing pathways:

  • The "What Pathway" (ventral stream) extends into the inferior temporal cortex and specialises in object recognition, form processing, and colour perception. It allows us to identify objects and recognise features.

  • The "Where Pathway" (dorsal stream) extends into the posterior parietal cortex and focuses on spatial relationships, motion perception, and visuomotor coordination. It helps us understand object position and movement in space.

This dual-stream processing enables us to simultaneously recognise what we see and understand its position and movement in space, which is essential for both navigation and artistic appreciation.

Visual Attention Mechanisms

Types of Visual Attention

Visual attention is the cognitive process that selects important information from our environment while filtering out less relevant data. This selectivity is crucial given the overwhelming amount of visual information constantly bombarding our senses.

Visual attention operates through two primary mechanisms:

  • Overt attention: Involves physical eye movements (saccades) to bring objects onto the fovea, the area of highest visual acuity

  • Covert attention: Involves mentally focusing on peripheral areas without moving the eyes

Both systems operate concurrently in everyday viewing but involve different neural pathways and can be separated experimentally.

Bottom-Up vs. Top-Down Attention

Visual attention processing occurs through two complementary systems:

  • Bottom-up attention: Driven by inherent properties of visual stimuli (brightness, movement, contrast) that automatically capture attention

  • Top-down attention: Guided by our goals, expectations, and prior knowledge

Research has shown that bottom-up attention modulates neural activity within the visual system, though this mechanism is less well understood than top-down processes. Several neuropsychiatric disorders, including schizophrenia and ADHD, have been linked to differences or deficits in bottom-up attentional processing.

The Neural Architecture of Attention

The brain's attention network involves several specialised regions working together. The parietofrontal network, which includes the lateral intraparietal area (LIP), frontal eye field (FEF), and superior colliculus, is crucial in guiding visual attention.

When these regions select a target, a set time elapses before an eye movement is triggered to the selected location. Multiple studies have observed this precise timing relationship, confirming that these areas provide the targeting data for both covert attention and overt eye movements.

The VISIT neural model of covert visual attention proposes specific roles for different brain structures:

  • Lateral geniculate nucleus (LGN), V1, and V2 serve as early feature maps

  • The pulvinar functions as a gating system

  • Superior colliculus and frontal eye fields operate as a bottom-up priority map

  • The posterior parietal cortex acts as a higher-level priority map and location of control networks

Neurochemistry of Visual Attention

Neurotransmitters—the brain's chemical messengers—significantly influence attention processes. Acetylcholine (ACh) is particularly important, with research showing that cholinergic enhancement improves attentional task performance and reduces neural signal variability.

Studies have demonstrated that:

  • Genetically reduced choline transporter (CHT) activity leads to poorer top-down attentional control

  • ACh modulates neural oscillations during attentional processing

  • In humans, cholinergic agonists enhance visual attention effects on alpha and beta brain oscillations

These findings highlight the complex chemical underpinnings of our ability to selectively attend to visual information.

Compositional Techniques and Their Neural Basis

Leading Lines: Neural Pathways of Visual Guidance

Leading lines represent one of the most powerful compositional techniques for directing viewer attention. Research using eye-tracking has demonstrated that leading lines significantly influence viewers' attention patterns, particularly when prominent subject elements are present1.

When viewing images with leading lines, observers demonstrate:

  • Greater engagement

  • Longer viewing times

  • Enhanced ratings on aesthetics and directional sense

These effects stem from our brain's natural tendency to follow lines. Pointing is a highly effective technique for directing a viewer's gaze due to humans' natural tendency to follow lines. This effect likely leverages the dorsal stream pathway, which specialises in spatial processing and perceptual guidance.

Leading lines create a visual flow that draws the eye through the image in a predictable pattern, effectively guiding the viewer's attentional sequence. When combined with a clear subject element at a strategic point along the lines, this technique becomes particularly effective, as confirmed by eye-tracking studies showing increased fixation duration on subjects when they are connected to leading lines.

The Rule of Thirds: Balancing Neural Activation

The rule of thirds divides an image into nine equal parts using two horizontal and two vertical lines, suggesting that key elements should be placed along these lines or at their intersections.

This technique creates compositions that feel naturally balanced to human perception. Research has shown that placing important objects along imagery thirds lines or around their intersections produces aesthetically pleasing photos6. The technique works by avoiding central placement, which might seem static, while also preventing the subject from being too close to the edges.

The Rule of Thirds is a composition guideline that places your subject in the left or right third of an image, leaving the other two thirds more open. This asymmetrical balance engages both the ventral stream (for object recognition) and the dorsal stream (for spatial relationships), creating a dynamic viewing experience.

Eye-tracking studies demonstrate that experts in photography are more sensitive to the rule of thirds than novices. When viewing photographs, experts spent more time focusing on the rule of thirds intersection points, while novices' eye movements scattered more broadly across the frame.

The Golden Ratio: Mathematical Harmony and Visual Processing

The golden ratio (approximately 1.618:1) has been used as a compositional tool across various art forms for centuries. Neurobiological studies suggest its appeal may stem from efficient neural processing.

Research using fMRI scans has shown that images composed following the golden ratio activate specific sets of cortical neurons as well as the insula, a structure mediating emotions. When subjects with no knowledge of art criticism were shown classical sculptures with proportions conforming to the golden ratio, these brain areas showed heightened activity compared to when viewing altered proportions.

A 2013 fMRI study demonstrated that images complying with the golden ratio elicited lower metabolic activity in the visual cortex, suggesting reduced neural strain. This indicates that such proportions may be processed more efficiently by our visual system, creating a sense of ease when viewing such compositions.

Rule of Odds: Cognitive Engagement Through Asymmetry

The rule of odds suggests that compositions featuring an odd number of elements are more engaging than those with even numbers.

This rule works because the human brain naturally likes to organise the things that we see around us. When there's an even number of items, the brain quickly pairs them off. However, with an odd number, no matter what it does, one of the items will always be the odd one out.

This creates a subtle cognitive challenge that:

  • Takes longer for the brain to process

  • Creates subliminal excitement

  • Results in more attention being paid to the photograph

This increased processing time likely involves heightened activity in regions associated with visual attention and cognitive processing, making images with odd-numbered elements more memorable and engaging.

Negative Space: Cognitive Relief and Subject Enhancement

Negative space—the empty area surrounding the main subject—plays a crucial role in directing attention and creating visual impact.

Negative space functions as breathing room for your eyes. Without adequate negative space, images become cluttered with every element in the photo screaming for the viewer's attention. This creates cognitive overload, diffusing attention rather than directing it.

The neural basis for this effect involves figure-ground segregation processes. Ample negative space enhances the subject through contrast, allowing the brain to more easily distinguish the subject and direct attention accordingly. It adds definition to your subject and is similar to a visual pause, reducing the cognitive load and allowing more focused processing of the main subject.

Research has shown that the ventral visual stream distinguishes foreground from background by analysing contrast and edge detection. This involves reciprocal interactions between the primary visual cortex (V1) and higher-order regions like the parietal lobe. By providing clear figure-ground separation, negative space facilitates this processing.

Light and Shadow: Contrast-Driven Attention

The interplay of light and shadow creates visual hierarchy and emotional resonance in photography.

Research indicates that direct light creates strong, well-defined shadows, adding drama to a scene, while diffused light produces softer shading, conveying a more gentle narrative. This manipulation of lighting directly affects neural responses, as the brain is highly sensitive to contrast.

Strong contrasts between light and shadow areas trigger pronounced neural responses in the visual cortex, directing attention to areas of high contrast. This attentional guidance occurs because our visual system evolved to detect edges and boundaries defined by luminance differences—a characteristic that photographers leverage through strategic lighting.

Additionally, the emotional impact of light and shadow compositions may involve interaction with the limbic system, as dramatic lighting can evoke emotional responses beyond pure visual processing.

Repetition: Habituation and Pattern Recognition

Repetition can both attract and modulate attention through its effects on neural processing.

Studies on repetition effects show complex interactions with attention. While massed repetition prompts a reduction in the late positive potential (LPP), emotional content continues to enhance the LPP despite repetition. This suggests that while repetition can lead to some habituation, emotionally significant repetitive elements maintain their ability to capture attention.

At the neural level, repetition involves mechanisms related to habituation and novelty detection. Neural attenuation occurs with repetition, where repeating a stimulus generally leads to a decreased response in neural activity compared to that for novel items. However, this attenuation is modulated by attention, with significant attenuation in the fMRI BOLD signal observed for the attended repeated scenes... while no attenuation was observed for ignored repeated scenes.

This complex relationship between repetition and neural response explains why repetitive elements in photography can be both calming (through predictability) and potentially less engaging if not balanced with novel elements.

Colour Processing and Emotional Response

Colour Processing in the Visual System

Colour perception begins with three types of cone cells in the retina, each sensitive to different wavelengths of light (roughly corresponding to red, green, and blue). However, our perception of colour is far more complex than simply registering these wavelengths.

The V4 area of the visual cortex plays a crucial role in colour processing, with specialised neurons that respond to specific hues regardless of illumination changes. This provides the neurological basis for colour constancy—our ability to perceive colours as relatively stable despite changes in lighting conditions.

Complementary colours (opposite on the colour wheel) create strong neural responses because they stimulate separate cone cells in the retina. This creates lateral inhibition in the retina's ganglion cells, enhancing edge contrast. The V4 area processes these contrasts, generating a perception of vibrancy.

Emotional and Psychological Effects of Colour

Colour affects not only our visual processing but also our emotional responses:

  • Warm colours (red, orange) activate the amygdala and hypothalamus, linked to arousal and attention1

  • Cool tones (blue, green) engage the default mode network, associated with calmness

PET scans reveal increased dopamine release in the ventral tegmental area when viewing harmonious colour palettes, suggesting a neurochemical basis for colour preferences.

While some colour responses are innate, the orbitofrontal cortex integrates learned preferences. For example, Western cultures often associate red with danger due to repeated exposure to traffic signals or warnings. These cultural associations create complex networks of response that interact with the basic neurological processing of colour.

Gestalt Principles and Pattern Recognition

Neurological Basis of Gestalt Perception

Gestalt psychology, developed in the early 20th century, posits that humans perceive visual elements as unified wholes rather than isolated parts. Modern neuroimaging techniques reveal how these principles correlate with specific brain functions.

Gestalt perception appears to be an efficient neural strategy—by grouping elements according to certain principles, the brain reduces the computational load required to process complex scenes. This efficiency may explain why images conforming to Gestalt principles often appear aesthetically pleasing.

Eye-tracking research has demonstrated that images exhibiting Gestalt qualities significantly affect viewing patterns and aesthetic judgments. For example, images with implied completion of forms (closure) result in fewer fixations but longer fixation duration, creating a stronger sense of beauty.

Key Gestalt Principles in Photography

Several Gestalt principles have direct applications in photographic composition:

  • Proximity: Elements that are close together are perceived as a group. Neurons in the visual cortex (occipital lobe) activate to cluster spatially close elements, and the lateral occipital complex identifies object boundaries, interpreting nearby elements as related.

  • Similarity: Objects that share visual characteristics (shape, colour, size) are seen as belonging together. Images showcasing repeated elements cause greater numbers of fixations and saccades, indicating increased complexity and visual interest.

  • Continuity: Our visual system naturally continues lines and curves beyond their endpoints. The brain follows the smoothest path when interpreting line segments, which explains why S-curves and other continuous shapes are visually satisfying in composition.

  • Closure: The brain tends to complete incomplete shapes, filling in gaps. Incomplete shapes trigger the prefrontal cortex to 'fill in' gaps, a process linked to predictive coding. The brain minimises sensory uncertainty by generating plausible completions, reducing cognitive load.

  • Figure-Ground: Our perception organises the visual field into figures (objects of focus) and ground (background). The ventral visual stream distinguishes foreground from background through contrast and edge detection, involving reciprocal interactions between the primary visual cortex and higher-order regions.

Practical Applications for Photographers

Leveraging Visual Processing in Composition

Understanding the neural mechanisms of visual perception provides photographers with scientific rationale for compositional choices:

  • High-contrast elements or golden ratio grids can direct saccadic eye movements, prioritising information uptake

  • Grouped elements and predictable patterns minimise activation in the anterior cingulate cortex, which monitors conflict resolution

  • Colour choices can modulate viewer mood by targeting specific neurotransmitter systems

Eye-tracking technology has revealed specific viewing patterns when people examine photographs:

  • Viewers tend to fixate first on areas of high contrast and visual weight

  • The pattern of fixations often follows compositional lines or implied directional cues

  • Faces and human figures typically attract immediate attention when present

Integration of Multiple Compositional Techniques

In practice, photographers often combine multiple compositional techniques to create visually compelling images. This integration creates layered attentional guidance that works across different neural processing mechanisms.

For example, an image might use leading lines to direct attention to a subject placed at a thirds intersection, surrounded by negative space, with dramatic lighting creating contrast. Each element engages different aspects of the visual attention system, creating a rich perceptual experience.

Research supports this integrated approach, showing that photographs that incorporate leading lines and subject elements tend to be more visually appealing than those not doing so; the lines and elements highlight the theme and create engaging and memorable photographic works.

Conclusion

The effectiveness of photographic compositional techniques is firmly grounded in the neural mechanisms of visual attention. Leading lines exploit our tendency to follow directional cues, the rule of thirds aligns with natural viewing patterns, negative space provides cognitive relief, and light and shadow leverage our sensitivity to contrast. Each technique engages specific aspects of our visual processing system, from oscillatory neural activity to specialised processing streams.

Understanding these neural underpinnings provides scientific insight into why traditional compositional rules work. This knowledge can inform creative decisions, deliberately engaging viewers' attention mechanisms to create more impactful images.

The relationship between composition and attention is bidirectional—compositional techniques shape attention, while understanding attention mechanisms can inspire new compositional approaches. As neuroscience advances, we may discover even more precise ways to engage visual attention through photographic composition, further bridging the gap between art and science.

Next
Next

Light & Shadows