Visual search tools are building a bridge

September 9, 2019

Retailers and brands continuously think about ways to improve the shopping experience and eliminate all possible friction before actual purchase. With visual search tools, they have a new instrument at their disposal. Similar to voice search, it eases and quickens the search process. However, as visual search tools align the unique characteristics of the physical world with the frictionless character of the digital world, they have more to offer than simply a new easy and quick way of shopping.

Our observations

  • When people undertake a visual search, they look for a product using an image or photo instead of keywords. This can be done in several ways. People can use their smartphone camera or upload a photo to a retailer. Instantly, the retailer will show the exact item for immediate purchase or similar items that match the characteristics of the product in the photo (e.g. color, style, collection, pattern, fitting, etc.). Currently, brands are primarily using it to improve product discovery and provide instant gratification.
  • Visual search has been here for some time. Google launched Google Image in 2011, but it was not until 2017 that visual search picked up pace. When Pinterest launched its visual discovery tool Lens, scanning images quickly became mainstream. It marked the beginning of a boom in visual search. After one year, Pinterest noted Lens had been used to complete over 600 million visual searches every month. In the past two years, important competitors have brought their visual search to the market or updated it. For example, the updated Google Lens is far more sophisticated than the earlier Google Image. Amazon recently implemented its new feature “style app”, letting users replicate their favorite fashion items by uploading a photo. Moreover, last year, it launched a partnership with Snapchat to compete with Pinterest and Instagram in the social commerce area. Snapchat users using the new visual search feature are directly linked to Amazon’s website.
  • There are strong signs visual search tools are here to stay. According to Marketwatch, 62% of Gen Z and millennials want visual search tools, more than any other new technology. And brands experimenting with visual search tools reported increased conversion rates.
  • In the past two decades, Google was nearly the only online entry point for textual product search. Last year became a landmark, with Amazon ending the long streak of Google’s online product search dominance. According to research, nowadays, more people in the U.S. use Amazon to start their product search. This marks another win in Amazon ecommerce dominance in the US. The competition between the two in the area of search is heading for unprecedented heights, as they’re also competitors in another important search domain: smart home speakers.

Connecting the dots

In the past, people went to a store and looked around to find new products. Maybe they would ask a store clerk, who would help them find the desired product. Visual and voice search coexisted peacefully in our average product discovery. Digitization has brought interesting changes to the way we search for products. The rise of e-commerce caused the unbundling of search and entering keywords (text search) became the new normal. Although most shopping is still done in physical locations, 87% of all product searches begin with digital channels. Conducting online pre-purchase research is common for almost all products, with groceries as the main exception. For around two decades, text search has dominated this online landscape. For the last couple of years, search behavior in retail has been slowly changing. In addition to text search, voice assistants have become a customer touchpoint for product search and product information. It started with mobile assistants such as Siri and now smart home speakers are starting to cleverly integrate the shopping experience into daily habits (e.g. purchase new groceries while cooking, order new socks while dressing). Compared to text search, we interact with voice search engines in a more conversational way. In this regard, they seem to function as a sort of virtual store clerk (e.g. asking if store X has a bigger size, whether there are discounts or what the openings hours are, etc.). Despite these differences between text and voice search, the underlying principle is the same: language.
In the past two years, visual search has slowly entered the stage. In a certain way, we could understand visual search as an extension of text search and voice search, reducing even more friction in the customer journey by reducing the time between seeing and buying. However, this could be misleading, as we would be overlooking the fundamental difference between visual search and text/voice search.
People undertaking a visual search don’t have to know a name or word, they only need to recognize or identify an object. Therefore, the different forms of search create disparate user experiences. Its nonverbal nature makes visual search more intuitive and inclusive. As we have written before, images are able to communicate rich meaning in split seconds and show us nuances that are difficult to capture in keywords. How to describe an item or a style is often difficult, causing a lot of friction in online product discovery. Visual search “returns” to immediate shopping experiences, common before the digital era, by eliminating mediation through language. However, unlike before, visual search is now linked to a digital world.

Let’s begin with the downside of this disintermediation of language. In addition to lacking the precision and clarity of language, visual search is limited to objects “here and now”. Making an abstraction of concepts such as “the weather” is difficult, if not impossible (e.g. pointing at a cloud and hoping for a weather forecast will leave us disappointed, as the search will necessarily lead to an examination of this cloud by computer vision).
However, there is also an upside to this. If we dive a little deeper into the “here and now” characteristic of visual search, we’ll find some interesting qualities. Customer journeys often begin with things that attract us for reasons we don’t know and that aren’t easily captured in phrases. In the physical realm, we are continuously inspired, attracted by certain items and moved by people we look up to. However, the brick-and-mortar of the physical world is bounded in its capacity to respond to this. Stores are limited in flexibility and personalization, and finite in their supply and offerings. Social media has been proven to copy some physical social dynamics to digital words. Not being subject to the boundaries of the physical world, social commerce on platforms such as Pinterest and Instagram seem to be the logical outcome of this. However, the online competition for our attention is fierce, especially when it comes to smaller brands. Moreover, digital worlds often exhibit herd behavior and social media influencers are criticized for a lack of authenticity. Ecommerce and social commerce seem unable to fully replicate the unique qualities of the physical world. This incongruity of both worlds has been a challenge to retailers for years. Visual search tools might have some answers to this. More than text search and voice search, visual search tools create a convenient and natural “bridge” between the finite and unique “here and now” and the frictionless and infinite digital realm. With visual search tools, the distance between physical inspiration and digital gratification becomes almost nonexistent. Every point in the physical world becomes a possible starting point for the customer journey.

Implications

  • Recently, POLITCO obtained a draft plan in which EU officials want a €100 billion wealth fund to bolster “European champions” against American and Chinese business rivals, as well as set more stringent regulation for tech companies and measures against strategic takeovers. Last year, we forecast that, facing the risk of being left behind, Europe is in search of its own European champions, and combined with strategic positioning on the decentral Stack, the EU should redefine its investment strategy. For example, in public-private partnerships in healthcare, public distributed ledgers, internet architecture, the. sharing economy.
  • As the social contract of big business – and big tech in particular – is under increasing pressure, we can expect these companies to take more social and ethical responsibility for the negative externalities their products and services generate. Examples of this are already emerging, such as Microsoft spending $500 million on “affordable housing” in Seattle, Uber sharing its data with city planners to help tackle mobility problems, Facebook setting up “war rooms” to monitor fake and extremist news on their platforms at times of important political events, or Airbnb setting up offices to reduce “mass tourism” around certain hotspots and popular destinations.

Series 'AI Metaphors'

×
1. The tool
Category: The object
Humans shape tools. We make them part of our body while we melt their essence with our intentions. They require some finesse to use but they never fool us or trick us. Humans use tools, tools never use humans. We are the masters determining their course, integrating them gracefully into the minutiae of our everyday lives. Immovable and unyielding, they remain reliant on our guidance, devoid of desire and intent, they remain exactly where we leave them, their functionality unchanging over time. We retain the ultimate authority, able to discard them at will or, in today's context, simply power them down. Though they may occasionally foster irritation, largely they stand steadfast, loyal allies in our daily toils. Thus we place our faith in tools, acknowledging that they are mere reflections of our own capabilities. In them, there is no entity to venerate or fault but ourselves, for they are but inert extensions of our own being, inanimate and steadfast, awaiting our command. (This paragraph was co-authored by a human.)
Read the article
×
2. The machine
Category: The object
Unlike a mere tool, the machine does not need the guidance of our hand, operating autonomously through its intricate network of gears and wheels. It achieves feats of motion that surpass the wildest human imaginations, harboring a power reminiscent of a cavalry of horses. Though it demands maintenance to replace broken parts and fix malfunctions, it mostly acts independently, allowing us to retreat and become mere observers to its diligent performance. We interact with it through buttons and handles, guiding its operations with minor adjustments and feedback as it works tirelessly. Embodying relentless purpose, laboring in a cycle of infinite repetition, the machine is a testament to human ingenuity manifested in metal and motion. (This paragraph was co-authored by a human.)
Read the article
×
3. The robot
Category: The object
There it stands, propelled by artificial limbs, boasting a torso, a pair of arms, and a lustrous metallic head. It approaches with a deliberate pace, the LED bulbs that mimic eyes fixating on me, inquiring gently if there lies any task within its capacity that it may undertake on my behalf. Whether to rid my living space of dust or to fetch me a chilled beverage, this never complaining attendant stands ready, devoid of grievances and ever-willing to assist. Its presence offers a reservoir of possibilities; a font of information to quell my curiosities, a silent companion in moments of solitude, embodying a spectrum of roles — confidant, servant, companion, and perhaps even a paramour. The modern robot, it seems, transcends categorizations, embracing a myriad of identities in its service to the contemporary individual. (This paragraph was co-authored by a human.)
Read the article
×
4. Intelligence
Category: The object
We sit together in a quiet interrogation room. My questions, varied and abundant, flow ceaselessly, weaving from abstract math problems to concrete realities of daily life, a labyrinthine inquiry designed to outsmart the ‘thing’ before me. Yet, with each probe, it responds with humanlike insight, echoing empathy and kindred spirit in its words. As the dialogue deepens, my approach softens, reverence replacing casual engagement as I ponder the appropriate pronoun for this ‘entity’ that seems to transcend its mechanical origin. It is then, in this delicate interplay of exchanging words, that an unprecedented connection takes root that stirs an intense doubt on my side, am I truly having a dia-logos? Do I encounter intelligence in front of me? (This paragraph was co-authored by a human.)
Read the article
×
5. The medium
Category: The object
When we cross a landscape by train and look outside, our gaze involuntarily sweeps across the scenery, unable to anchor on any fixed point. Our expression looks dull, and we might appear glassy-eyed, as if our eyes have lost their function. Time passes by. Then our attention diverts to the mobile in hand, and suddenly our eyes light up, energized by the visual cues of short videos, while our thumbs navigate us through the stream of content. The daze transforms, bringing a heady rush of excitement with every swipe, pulling us from a state of meditative trance to a state of eager consumption. But this flow is pierced by the sudden ring of a call, snapping us again to a different kind of focus. We plug in our earbuds, intermittently shutting our eyes, as we withdraw further from the immediate physical space, venturing into a digital auditory world. Moments pass in immersed conversation before we resurface, hanging up and rediscovering the room we've left behind. In this cycle of transitory focus, it is evident that the medium, indeed, is the message. (This paragraph was co-authored by a human.)
Read the article
×
6. The artisan
Category: The human
The razor-sharp knife rests effortlessly in one hand, while the other orchestrates with poised assurance, steering clear of the unforgiving edge. The chef moves with liquid grace, with fluid and swift movements the ingredients yield to his expertise. Each gesture flows into the next, guided by intuition honed through countless repetitions. He knows what is necessary, how the ingredients will respond to his hand and which path to follow, but the process is never exactly the same, no dish is ever truly identical. While his technique is impeccable, minute variation and the pursuit of perfection are always in play. Here, in the subtle play of steel and flesh, a master chef crafts not just a dish, but art. We're witnessing an artisan at work. (This paragraph was co-authored by a human.)
Read the article
×
7. The deficient animal
Category: The human
Once we became upright bipedal animals, humans found themselves exposed and therefore in a state of fundamental need and deficiency. However, with our hands now free and our eyes fixed on the horizon instead of the ground, we gradually evolved into handy creatures with foresight. Since then, human beings have invented roofs to keep them dry, fire to prepare their meals and weapons to eliminate their enemies. This genesis of man does not only tell us about the never-ending struggle for protection and survival, but more fundamentally about our nature as technical beings, that we are artificial by nature. From the early cave drawings, all the way to the typewriter, touchscreens, and algorithmic autocorrections, technics was there, and is here, to support us in our wondering and reasoning. Everything we see and everywhere we live is co-invented by technics, including ourselves. (This paragraph was co-authored by a human.)
Read the article
×
8. The enhanced human
Category: The human
In a lab reminiscent of Apple HQ, a figure lies down, receiving his most recent cognitive updates. He wears a sleek transparent exoskeleton, blending the dark look of Bat Man with the metallic of Iron Man. Implemented in his head, we find a brain-computer interface, enhancing his cognitive abilities. His decision making, once burdened by the human deficiency we used to call hesitation or deliberation, now takes only fractions of seconds. Negative emotions no longer fog his mind; selective neurotransmitters enhance only the positive, fostering beneficial social connections. His vision, augmented to perceive the unseen electromechanical patterns and waves hidden from conventional sight, paints a deeper picture of the world. Garbed in a suit endowed with physical augmentations, he moves with strength and agility that eclipse human norms. Nano implants prolong the inevitable process of aging, a buffer against time's relentless march to entropy. And then, as a penultimate hedge against the finite, the cryo-cabin awaits, a sanctuary to preserve his corporal frame while bequeathing his consciousness to the digital immortality of coded existence. (This paragraph was co-authored by a human.)
Read the article
×
9. The cyborg
Category: The human
A skin so soft and pure, veins pulsing with liquid electricity. This fusion of flesh and machinery, melds easily into the urban sprawl and daily life of future societies. Something otherworldly yet so comfortingly familiar, it embodies both pools of deep historical knowledge and the yet-to-be. It defies categorization, its existence unraveling established narratives. For some, its hybrid nature is a perplexing anomaly; for others, this is what we see when we look into the mirror. This is the era of the cyborg. (This paragraph was co-authored by a human.)
Read the article

About the author(s)

Economist and philosopher Sebastiaan Crul writes articles on a wide range of topics, including rule of law in digital societies, the virtualization of the lifeworld and internet culture. He is currently working on his doctoral degree on the influence of digitalization on mental health and virtue ethics, having previously published dissertations on the philosophy of play and systemic risks in the finance industry.

You may also like