Navigating the complicated landscape of cloud storage

Published: June 2nd, 2021

It goes without saying that 2020 was an unforgettable year. It was unforgettable in a different way for the major cloud service providers, all of which experienced an impressive surge in demand. Market leader AWS closed out 2020 with revenues of $45.3 billion, up nearly 30% year-over-year and more than $13.5 billion in annual operating profits—which is 63% of Amazon’s total operating profits for the year.

Roughly 50 percent of all corporate data is stored in the cloud, according to Statista.

Storing data in a cloud service eliminates the need to purchase and maintain data storage infrastructure, since infrastructure resides within the data centers of the cloud IaaS provider and is owned and managed by the provider. Beyond cost savings, cloud storage provides valuable flexibility for data management. IT organizations are increasing data storage investments in the cloud for backups and data replication, data tiering and archiving, data lakes for artificial intelligence (AI) and business intelligence (BI) projects, and to reduce their physical data center footprint.

Just as with on-premises storage, in the cloud, you can purchase different levels of storage based on whether the data is hot (accessed frequently) or cold. This way, you are not overpaying for storing data that is needed only for archives or for very occasional access. You can use data management solutions to set up policies and automatically move data to the right cloud storage class based on parameters such as age, owner and cost.

The leading use case for cloud storage today is handling the petabytes of unstructured data that enterprises are amassing: file data from many different applications such as genomic sequencing, electric cars, bodycam videos, Internet of things (IoT), seismic analysis and collaboration tools. Migrating file data to the cloud is hard because it can take a long time and entails unique requirements regarding access controls and security. Depending on the type and volume of data you wish to move to the cloud, you will need to adjust your strategies appropriately.

Here are considerations as you evolve your cloud data management strategy to avoid getting burned on cost and performance:

Secondary storage tier gotchas. Enterprise IT organizations are increasingly seeing the value of the cloud as a secondary or tertiary storage tier because it frees up space on expensive on-premises storage and allows you to leverage the cloud for AI and analytics. However, it’s easy to get burned when a storage vendor writes data to the cloud in a proprietary format. Data in non-native format must be read through the vendor’s application before use, making it difficult for other applications to use. As well, in some cases the data must be rehydrated to the source and then moved before use. Ensure that you understand the limitations of moving your data to the cloud and if it’s in a format that is acceptable to common use cases.
Managing shadow IT. It’s true: shadow IT is no longer a dirty word. But opening up the cloud to your workforce without guardrails can get messy quickly. Conversely, by creating a well-defined strategy and data governance process for the cloud, you can minimize the negative effects of shadow IT while still allowing employees to experiment safely with approved apps and services.
A worsening problem of data islands. The cloud, for all its merits, has added data silos – made even more scattered by the multi-cloud movement. Clouds have different storage classes and tiers for file and object storage, all of which need to be leveraged for a cost-effective file data management strategy. These result in more silos to manage. Regardless, hybrid IT is here to stay for most midsize-to-large enterprises and it means that IT leaders need to determine how to get a central view and management plan for data and assets. This doesn’t mean that you need to store all the data in one place, but you will need visibility to move data and workloads around as needed based on cost, performance and/or business requirements.
Hidden costs. The challenges of cloud sprawl and VM sprawl have been known for quite some time. Moving to the cloud requires constant oversight to ensure that you aren’t wasting money with unused or ill-used resources. Another issue, however, is making sure that file data is managed and tiered appropriately; don’t manage cold and hot data the same way or you will take it on the chin with nasty egress fees and unnecessary API costs. A large government agency was recently in the news for spending millions of dollars on egress fees as the data they moved to the cloud was in fact accessed frequently: Ouch. Understand your data, and all the areas where the cloud can bite you. Be sure to talk to your IT vendors about these risks and how to avoid them.
Skills, skills! Yes, the talent gap remains large in technology, so IT leaders must always factor this into the equation when making dramatic changes in strategy. A recent CompTIA survey found that 74% of large firms will be hiring for IT and technology roles in 2021, with a particular focus on advanced infrastructure, AI and data science, and people skills for remote collaboration.
Unrealistic expectations for savings. Over the long haul, an organization can easily save on cloud storage versus maintaining a lot of technology inside the corporate data center. But this requires a well-defined data strategy. It’s better to think about the benefits of moving from a CapEx to a more predictable OpEx spending model, without the hidden intangible expenses that occur from traditional IT. As you optimize cloud infrastructure, you won’t have to worry about expensive hardware sitting in your data center, cooling costs, regular fire drills and the hassle of maintaining and securing everything.

Thinking for the long term

There is untold value in the massive amounts of unstructured data that organizations are storing; some estimates report only 1% to 2% of this data is actually being used. Have the necessary conversations with your vendors, consultants and in-house stakeholders to clearly understand all of your data assets: where it resides, who’s using it and how often, and its strategic value to the organization. By gathering this information, you will be able to make informed decisions about your data and where it should live. These decisions will evolve with business needs, so ensure that you have the means to continually analyze your assets and adjust your strategy as needed.

Article Tags

cloud, cloud storage, komprise

About Michael Del Castillo

Michael Del Castillo is a solutions engineer at data management solutions provider Komprise.

View all posts by Michael Del Castillo

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__atuvc	1 year 1 month	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.
__atuvs	30 minutes	AddThis sets this cookie to ensure that the updated count is seen when one shares a page and returns to it, before the share count cache is updated.

Cookie	Duration	Description
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_WTGVKVXEZJ	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_107693958_2	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_jsuid	1 year	This cookie contains random number which is generated when a visitor visits the website for the first time. This cookie is used to identify the new visitors to the website.
at-rand	never	AddThis sets this cookie to track page visits, sources of traffic and share counts.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
iutk	5 months 27 days	This cookie is used by Issuu analytic system to gather information regarding visitor activity on Issuu products.
uvc	1 year 1 month	Set by addthis.com to determine the usage of addthis.com service.

Cookie	Duration	Description
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
loc	1 year 1 month	AddThis sets this geolocation cookie to help understand the location of users who share the information.
mc	1 year 1 month	Quantserve sets the mc cookie to anonymously track user behaviour on the website.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__gpi	1 year 24 days	No description
_heatmaps_g2g_101137905	10 minutes	No description
cf_7167_id	20 years	No description
cf_7167_person_last_update	session	No description
GoogleAdServingTest	session	No description
prism_252377639	1 month	No description
querylyvid	3 months	No description
xtc	1 year 1 month	No description

Navigating the complicated landscape of cloud storage

Article Tags

Subscribe to SDTimes

About Michael Del Castillo

Related Articles

AI Governance is the Next IT Battleground

Snowflake to acquire Observe to improve its ITOM position, help customers proactively troubleshoot issues

AWS outage highlights risks of single cloud deployments

Four trends reshaping Kubernetes platform engineering