SDS 485: Financial Data Engineering — with Doug Eisenstein

Topics covered
Popular Clips
Episode Highlights
Resolution Issues
Entity resolution in the financial sector is fraught with complexities due to diverse data sources and identifiers. highlights the challenge of matching identifiers across various financial instruments like indices, ETFs, and futures contracts, which provide unique metrics but require connection for comprehensive data coverage 1. This issue becomes particularly pronounced as companies transition from fundamental to systematic, data-driven trading, necessitating robust entity extraction and resolution processes 2. Doug emphasizes the importance of capturing descriptive characteristics like name, country, and currency to effectively link entities over time 1.
We've seen this as a huge problem. And that's one of the areas that we find is the most challenging for customers to really get right.
---
The ability to resolve these entities is crucial for investment managers dealing with multi-asset classes, as they must navigate data from various sources and adapt to changes like mergers and acquisitions 2.
Extraction Insights
Entity extraction involves identifying and linking specific entities within financial data sets, a process crucial for making informed investment decisions. explains that this involves scanning documents like SEC filings to extract important nouns, such as company names or financial instruments, and linking them using structures like knowledge graphs 3. Doug discusses products developed at Aristos, such as Finflow and Dominus, which address these challenges by managing entity data and linking disparate data sources 4.
It's the way you describe it is, right? Like, you have to be. You're searching through documents, through structured, through structured data, semi structured, unstructured data.
---
These tools help streamline data flow and ensure accurate entity recognition, which is essential for navigating the complexities of financial data and making sound investment choices 4.
Related Episodes


SDS 595: Data Engineering 101 — with Joe Reis and Matt Housley
Answers 383 questions

SDS 587: Data Engineering for Data Scientists — with Mark Freeman
Answers 383 questions

SDS 437: Data Science at a World-Leading Hedge Fund — with Claudia Perlich
Answers 383 questions

SDS 535: How to Found, Grow, and Sell a Data Science Start-up — with Austin Ogilvie
Answers 383 questions

SDS 601: Venture Capital for Data Science — with Sarah Catanzaro
Answers 383 questions

SDS 477: How to Thrive as an Early-Career Data Scientist — with Sidney Arcidiacono
Answers 383 questions

SDS 619: Tools for Deploying Data Models into Production — with Erik Bernhardsson
Answers 383 questions

SDS 433: Data Science Trends for 2021 — with Ben Taylor
Answers 383 questions

SDS 435: Scaling Up Machine Learning — with Erica Greene
Answers 383 questions
SDS 468: The History of Data — with Jon Krohn
Answers 383 questions

SDS 615: How to Ace Your Data Science Interview — with Nick Singh
Answers 383 questions

SDS 509: Accelerating Start-up Growth with A.I. Specialists — with Parinaz Sobhani
Answers 383 questions

SDS 561: Engineering Data APIs — with Nate Fox
Answers 383 questions













