Lies, Damned Lies: The unreliable world of analytics and data science

The rumble in the AI jungle: Will the CDO or CIO emerge victorious in the race to implement GenAI?

December 14, 2023

Reading Time: 5 minutes

As we pass the first anniversary of the start of the most recent tech hype tsunami (farewell NFTs, we hardly knew ye), it’s starting to become a little clearer how many enterprises might start their journey with Generative AI – and it’s in areas that are the traditional domain of the CIO, rather than the CDO. So, does that mean that the CDO role will be sidelined in the rush to GenAI? Or should the CDO suit up and go head-to-head with the CIO to lay claim to this growing area? Can we expect a CDO-CIO Rumble in the Jungle?

The Great SQL bot Bake Off: Comparing the big LLM beasts on SQL code generation

July 25, 2023

Reading Time: 9 minutes

A side-effect of all the time I spend breathing the rarified alpine air of the CDO community is that my SQL skills have become rather rusty. So I’ve been intrigued by the idea of using the code-generation capabilities of tools like ChatGPT and Bard to write SQL for me. But how good is the current crop of LLMs at creating SQL code that not only works, but generates the insight you’re actually looking for? I decided to find out.

Your stakeholder is (not) your father-in-law

June 16, 2023

Reading Time: 5 minutes

You arrive, slightly frazzled, a few minutes after the agreed time for lunch. At the door, you thrust the obligatory bottle of wine into your mother-in-law’s hand and say that you’re sorry you’re late; the traffic was terrible. “Which way did you come?” pipes up your father-in-law from the hallway, and you immediately realise that you have made a grave error.

As you stand, dazed, nodding along while nursing a room-temperature glass of Pinot Grigio, you find yourself thinking, “didn’t I just have to endure a conversation like this with the CMO this week?”

Ten ways to hit a home run with your data strategy

November 3, 2022November 1, 2022

Reading Time: 9 minutes

If you’re a CDO (in either name or responsibility), chances are you’ve had to write a data strategy. If you haven’t, you may feel that everything would go much more smoothly if you were able to pull it out of your bag and wave it in the face of every naysaying executive stakeholder who dares to question your work, with a righteous cry of, “it’s in the data strategy!” Sadly, naysayers are not so easily swayed. But there are some things you can do to at least raise the chances of your data strategy causing the C-suite to fall gratefully in line.

Not dead yet: The long goodbye of third-party cookies

October 26, 2021August 25, 2021

Reading Time: 10 minutes

The demise of cookies has been long foretold. Ever since they were invented in 1994, questions have been raised over their privacy implications and potential misuse; yet they have persisted, unloved but indispensable. However, it seems like the death knell has at last sounded for third-party cookies, with both Apple and Google finally taking concrete steps to rein in their use. But given that third-party cookies and mobile Ad IDs still underpin a huge amount of the web and its economics, what will life be like without them? And more importantly, will it actually be better than it was before?

Nasty, brutish and short: The life of the modern CDO

October 26, 2021May 26, 2021

Reading Time: 6 minutes

The 2010s were a big decade for Chief Data Officers: from a standing start ten years ago, CDO has risen to become an indispensable C-suite role, with almost two thirds of Fortune 500 organizations hiring one.

But the role of CDO, especially outside the US, is still poorly defined, and CDOs are frequently not set up for success within their organizations. Is the job a poisoned chalice?

Demystifying Data Science, Part V: AutoML

April 20, 2020March 30, 2020

Reading Time: 7 minutes

As we’ve established earlier in this post series, Data Science is a process, with quite a lot of repetitive elements. Many Data Science projects involve a familiar set of tasks to identify, clean and prepare data, before finding the best model for the scenario at hand. And despite the mystique around the whole profession, many Data Scientists spend a lot of time complaining about all this repetitive work. But any repetitive process is ripe for automation, and Data Science is no exception. Enter the field of “AutoML”.

Google’s Ban on Third-party Cookies Could Actually Harm User Privacy

April 20, 2020January 28, 2020

Reading Time: 3 minutes

There was quite a lot of coverage earlier this month when Google announced that they would be phasing out support for third-party cookies in Chrome within the next two years. The stock price of firms like Criteo, which rely heavily on third-party cookie data for their core business, dipped sharply. The general consensus has been that this was a welcome move in terms of user privacy – but nixing third-party cookies could actually harm user privacy, by making it harder to identify irresponsible sharing of user data.

Demystifying Data Science, Part IV: Models and Machine Learning

April 20, 2020July 2, 2019

Reading Time: 9 minutes

As I mentioned in my first post in this series, the central purpose of Data Science is to find patterns in data and use these patterns to make useful predictions about the future. It’s this predictive part of Data Science which gives the discipline its mystique; even though Data Scientists actually only spend a relatively small fraction of their time on this area compared to the more workaday activities of loading, cleaning and understanding the data, it’s the step of building predictive models which unlocks the value hidden within the data.