For most retailers, the fourth quarter of the year is crucial. More specifically, the days and weeks right after Thanksgiving in the US can make or break a business. From Black Friday (the day after Thanksgiving) through Cyber Monday (the following Monday) to Cyber Week (the entire week after), this short period of intensive shopping is seen as the retail industryâs Super Bowl.
This is particularly true in the world of e-commerce. According to TechCrunch, US shoppers bought $6.95 billion worth of stuff online last Cyber Monday alone. And at the peak of Black Fridayâs frenzy, shoppers were spending upwards of $1 million per minute online, which is why retailers spend their entire year preparing for this quarter. Promotions are planned, emails are prepped, new products are added, sites are updated, and merchandising is carefully mapped out.
Failure is expensive
Unfortunately, even with all that planning, site outages on these critical days are still common. And the resulting losses can be substantial, as some estimates claim that up to eight percent of the dayâs sales can be lost for each hour a site is down can be lost.
E-commerce infrastructure should be instrumented for quick debugging and troubleshooting, so that engineers can spot a problem and fix it fast.
A simple Google search of âBlack Friday crashes 2017â shows some of the recent carnage. Last year, Macyâs and Loweâs were in the crosshairs. A couple years prior, it was Target and Paypal. So even the biggest names arenât immune, but why?
Retailâand e-commerce in particularâis tougher than itâs ever been. The customer journey has changed immensely over the past several years, and most e-commerce platforms simply werenât built for the kind of complexity weâre seeing today. Being a large, established retailer or tech firm may not be an advantage, unless youâve updated and adapted your systems to account for todayâs increasingly mobile, and global, consumers.
Whoâs at risk in 2018?
As much fun as it would be, I canât give you an advance list of retail sites that are going to crash this year (sorry). But I can tell you the key characteristics of systems that can better handle heavy loads and high traffic. After the Cyber Week smoke clears, the merchants left standingâand counting their cashâwill be the ones whose e-commerce infrastructure was designed and built with these principles in mind.
- Scalability: Infrastructure should be able to grow vertically (increasing the power of the machines) or horizontally (adding new machines) without loss of performance or reliability. Cloud services such as AWS provide this in the form of auto-scaling groups and containerized services and applications.
- Elasticity: Thatâs the capacity of an infrastructure to grow horizontally in an automated way. The old-fashioned alternative to this is over-provisioning capacity, or paying for more server power than you actually need outside of peak traffic periods. Now, all major cloud computing solutions can provide elasticity.
- Observability: E-commerce infrastructure should be instrumented for quick debugging and troubleshooting, so that engineers can spot a problem and fix it fast. (No âblack boxes!â) Robust systems include clear monitoring, logging, and alerting for key system events, in anticipation of problems.
Â Start doing these four things, now
Focus on these concrete actions you can take to make future holiday seasons less stressful.
1. Test, test, test.
The key to avoiding a crash is understanding what your platform can handle. If you send an email blast to your entire list, can your shop deal with the resulting spike in traffic? How about if your promotion gets syndicated on a major coupon site? In other words, how quickly can your systems scale when traffic and orders peak? Good load testing requires thoughtful setup and frequent re-testing. There are services (such as Blazemeter and LoadImpact) to do it for you if you donât have the expertise in-house.
2. Build for speed.
Good testing should reveal your weak points. Is it your database? Do images load slowly? Site speed is the single most important feature. Apart from signaling that all might not be well with your infrastructure, a slow website is terrible for your business. Every fraction-of-a-second delay hurts conversions, and sites that take longer than a couple of seconds to load could chase more than half your shoppers away.
3. Have a backup plan.
If you do crash, how quickly can you come back online?How resilient and redundant are your systems? Is your data intact? Have you practiced a full restore? If you are on a third-party commerce platform, do you know who to call 24/7/365? Do you know what your service providerâs SLAs [service-level agreements] actually cover? If you arenât in control of your platform, youâd better know what type of help you can expect during these critical outages.
4. Get comfy with continuous updates.
Know this, accept it, and make it a strength instead of a weakness: When systems are pushed to their limits, something will go wrong. You will need to make an update, fix a problem, and deploy new code. E-commerce giants like Amazon have built continuous integration and delivery into their daily engineering practice, to the point that they deploy new code every few seconds, without their sites ever going dark. This is the new reality, and we all need to adapt to it.
Buckle up and enjoy the ride
A couple weeks from now, weâll be reading post-mortem reports about this yearâs devastating crunch-time crashes. Someone will go down. Itâs been the reality every year. Hereâs hoping itâs not you (fingers crossed). However, if youâve been under-investing in the areas I described above, your time may be coming. Huddle up with your key technical decision-makers and review your infrastructure and engineering practices.
Reaction Commerce provides an open-source e-commerce platform.
AddSearch Custom Site Search