Data Team & Data Strategy : Why ?

Ever wondered why organizations need a robust Data Strategy or even a Data team in the first place ?
Well, wonder no longer : at the InvestOps Asia conference in Singapore, I had the opportunity of addressing this crucial question among several others in a fireside chat with Jason Inzer on Harnessing the Latest Data Innovations to build a Holistic Data Strategy.

Here’s what I had to say on the topic :

๐Ÿ” Data team & Data Strategy : Why ?
Letโ€™s rewind 15 years. Organizations operated with various departmentsโ€”Front Office, Compliance, IT, etc.โ€”each with a primary function, yet burdened by data as a byproduct.
Take IT as an example : they manage infrastructure and the applications that run on top; these happen to use and/or produce data. This is the byproduct of the “main” function. If you push the reasoning a bit further (operators may key in erroneous data, crashing systems), it can be argued to be a liability.
This is why Data Team started to appear, and that is the goal of a Data Strategy : take all these siloed liabilities, and turn them in an asset for your organization through transversal management.

๐ŸŽฏ The Core Challenges:
1. Understanding the Transformation: This is not a technical upgrade but an organizational transformation. It requires altering how everyone in the organization works, and driving this change through the “processes, people, tools” triangle at the same time.
2. Clarity of Purpose: Know why youโ€™re embarking on this journey. Avoid being driven by external pressures like regulations or trends (“CEO wants to do GenAI !”). Focus on defining extremely well the problems within your organization you want to address.
3. Empowering the CDO: Your Chief Data Officer will ask a lot of changes to a lot of people. In most (any ?) organization, this is a political minefield. Ensure they are positioned within decision-making centers and can sell their strategy and roadmap effectively.

Tackling siloed data in a firm-wide data strategy can feel like trying to piece together a puzzle with mismatched pieces: how do you bring together disparate processes, data stores, and analytics into a cohesive strategy?

๐Ÿ’ก Key Considerations:
– Start Small, Think Big: You cannot change everything everywhere all at once. Begin with a strategic area where success is likely, bring that within your framework (remember “people, processes & tools” !) and then learn/improve/iterate to expand to other chosen areas.
– Empower, Don’t Dictate: Instead of imposing changes, use proven principles (think hashtag#Agile) to create “freedom within boundaries.” Help the teams, add value to their day to day, not constraints.
Let me share an example from a previous life:
A trading chain was made up of numerous integrated systems. This created a persistent issue with tracking trades across this complex web (particularly, making sure no trade was slipping through the cracks). My team stepped in, not just providing a technical solution using Azure Synapse, but more importantly working with the business aligned IT team to build together a strategy-compliant, data-driven approach to solve this problem (teach a man how to fish vs give the man a fish). I truly believe this collaborative effort is what will scale a data strategy within an organization.

I have this sentence saying that “If only data people do data work, you’re doing it wrong.” To me, it encapsulates the essence of a successful data strategy: it’s a balance of top-down guidance and bottom-up empowerment. Cast a wide net with minimal constraints initially, and gradually normalize and standardize processes collaboratively.

๐Ÿ”ง Quick Tools & Frameworks Mention:
Data Modeling: Unify data models across silos for aggregatable data.
Data Lake: Utilize technologies like data lakes for flexible, scalable data management.

Next, weโ€™re diving into the heroes of data managementโ€”Data Quality and Data Governanceโ€”and why theyโ€™re the building blocks of a successful Data Strategy.

๐Ÿ’ก Why Data Quality and Data Governance Matter:

๐˜ฟ๐™–๐™ฉ๐™– ๐™ž๐™จ ๐™ฉ๐™๐™š ๐™ฃ๐™š๐™ฌ ๐™ค๐™ž๐™ก: this phrase has been overused to the point it became meaningless. Its forgotten core truth remains: Your organization needs to ๐˜ต๐˜ณ๐˜ถ๐˜ญ๐˜บ treat its data as an asset.
Think about how a financial institution meticulously manages its trading books :
โ€ข Operations department controls and reconciles them daily against their source of truth (Prime Brokers, etc.)
โ€ข Risk department checks them against predetermined risk limits
โ€ข Finance department controls and certifies them monthly/quarterly/yearly

Data needs to be treated in the same way, and that is the purpose of ๐——๐—ฎ๐˜๐—ฎ ๐—š๐—ผ๐˜ƒ๐—ฒ๐—ฟ๐—ป๐—ฎ๐—ป๐—ฐ๐—ฒ.
To say it simply, you want your data assets to be 1๏ธโƒฃ ๐——๐—ถ๐˜€๐—ฐ๐—ผ๐˜ƒ๐—ฒ๐—ฟ๐—ฎ๐—ฏ๐—น๐—ฒ, 2๏ธโƒฃ ๐—ง๐—ฟ๐˜‚๐˜€๐˜๐—ฎ๐—ฏ๐—น๐—ฒ, and 3๏ธโƒฃ ๐—ฆ๐—ฎ๐—ณ๐—ฒ๐—น๐˜† ๐—จ๐˜€๐—ฎ๐—ฏ๐—น๐—ฒ :
โ€ข Policies and Processes: Define the framework that all departments should respect
โ€ข Empowerment and Responsibility: Distribute roles and responsibilities across your organization to ensure everyone is accountable for these new tasks.
โ€ข People Over Tools: Successful data governance is about empowering your team, not just deploying tools. Tools help automate and streamline your new processes, but people make the difference.

Data Quality is a critical component of Data Governance, focusing on ๐—•๐˜‚๐—ถ๐—น๐—ฑ๐—ถ๐—ป๐—ด, ๐—˜๐—ป๐˜€๐˜‚๐—ฟ๐—ถ๐—ป๐—ด ๐—–๐—ผ๐—ป๐˜๐—ถ๐—ป๐˜‚๐—ฒ๐—ฑ, and ๐——๐—ฒ๐—บ๐—ผ๐—ป๐˜€๐˜๐—ฟ๐—ฎ๐˜๐—ถ๐—ป๐—ด Trust in your data.
Assigning roles like Data Owners and Data Stewards (and empowering people for this responsibility) will maintain data hygiene and transparency throughout your organization.

โš™๏ธ Frameworks and Resources:
For those looking to dive deeper, consider exploring the DAMA (Data Management Association) DMBOK (Data Management Body of Knowledge) framework.



Finally, let’s explore some important considerations to strike a balance between Data Access, Privacy, and Security, and why a Data Catalog is essential.

๐Ÿ๏ธ ๐—ง๐—ต๐—ฒ ๐—œ๐—ฑ๐—ฒ๐—ฎ๐—น ๐—ฆ๐—ฐ๐—ฒ๐—ป๐—ฎ๐—ฟ๐—ถ๐—ผ:
Imagine a central hub where everyone in your organization can discover data assets, evaluate their suitability for their purpose, and request access safely, compliantly, and efficiently.
Think about all the problems you could solve, and the all data products you could build!

Letโ€™s break down how to get started.

1๏ธโƒฃ ๐—ž๐—ป๐—ผ๐˜„ ๐—ฌ๐—ผ๐˜‚๐—ฟ ๐—”๐˜€๐˜€๐—ฒ๐˜๐˜€
– ๐™๐™š๐™œ๐™ž๐™จ๐™ฉ๐™ง๐™ฎ ๐™–๐™ฃ๐™™ ๐™ˆ๐™š๐™ฉ๐™–๐™™๐™–๐™ฉ๐™–: Start with a “registry” (= DataCatalogue) of your main data assets, including data that describes your data (= Metadata).
Donโ€™t stop at datasetsโ€”catalog everything from applications to hardware (or build links to your existing registries).
While you want to eventually target for exhaustivity, I recommend starting by a smaller domain within your organization, do a thorough job in metadata definition and asset identification, instead of trying to address all domains averagely.
– ๐™๐™ก๐™š๐™ญ๐™ž๐™—๐™ž๐™ก๐™ž๐™ฉ๐™ฎ ๐™ž๐™จ ๐™†๐™š๐™ฎ: Especially in Asia, and in regulated industries like finance, you must comply with widely different regulatory views.
Your metadata must therefore be able to adapt, and answer many questions, e.g. does a dataset contains Personal Identifiable Information (PII), does it contain client data, from which geography, etc.
Without this, managing data will stay a manual task, error-prone and (unsustainably) time consuming.

2๏ธโƒฃ ๐—–๐—ผ๐—ป๐˜๐—ฟ๐—ผ๐—น๐—น๐—ฒ๐—ฑ ๐—”๐—ฐ๐—ฐ๐—ฒ๐˜€๐˜€ ๐—ฎ๐—ป๐—ฑ ๐—จ๐˜€๐—ฎ๐—ด๐—ฒ ๐—ช๐—ผ๐—ฟ๐—ธ๐—ณ๐—น๐—ผ๐˜„๐˜€
– ๐™๐™ก๐™š๐™ญ๐™ž๐™—๐™ก๐™š ๐˜พ๐™ค๐™ฃ๐™ฉ๐™ง๐™ค๐™ก๐™จ: Implement diverse access controls to meet the various regulatory requirements.
For the same reasons mentioned above, flexibility here is crucial to ensure compliance.
– ๐˜ผ๐™ช๐™™๐™ž๐™ฉ๐™–๐™—๐™ž๐™ก๐™ž๐™ฉ๐™ฎ ๐™–๐™ฃ๐™™ ๐™๐™š๐™ฅ๐™ค๐™ง๐™ฉ๐™ž๐™ฃ๐™œ: Regulations often require detailed logs and reports on data access and usage (think GDPR for example).
Capture every request, workflow, and usage to maintain transparency and compliance.

๐Ÿ’ก ๐—ง๐—ต๐—ฒ ๐—•๐—ถ๐—ด๐—ด๐—ฒ๐—ฟ ๐—ฃ๐—ถ๐—ฐ๐˜๐˜‚๐—ฟ๐—ฒ:
Embedding these processes into your daily operations across departments is your only option for scalability.
Start smallโ€”choose a department where you can make a significant impact and use it as a pilot to learn from, then refine and iterate.
As said in previous posts, these operations need to be part of your daily processes, and not an extra responsibility handled by a “data team” on the side.

By doing so, you’ll not only improve data management but also help break down organizational silos, creating a more cohesive and efficient data-driven culture.

Leave a Comment

Your email address will not be published. Required fields are marked *