申请试用
HOT
登录
注册
 
Massive-Scale Entity Resolution Using the Power of Apache Spark and Graph

Massive-Scale Entity Resolution Using the Power of Apache Spark and Graph

Spark开源社区
/
发布于
/
6635
人观看
Spark’s graph capabilities are great at enabling analysis of networks for use-cases such as fraud-detection, illicit network detection, and supply chain risk analysis. However, in order for a data scientist to perform analytics on a network (e.g., Page Rank, community detection, etc.), they end up spending all their time fighting a mountain of data integration challenges. A specific challenge this talk will focus on is connecting entities in a network within and across data domains. We will explore how you can leverage the Spark ecosystem’s graph capabilities to perform massive-scale entity resolution (ER). As a result, your data scientists will be able to more quickly and effectively perform graph analytics that drive business and mission value. Key takeaways: 1) The Spark ecosystem enables you to quickly get started with graph analytics use-cases at scale 2) Complementing traditional ER techniques with the context of graph relationships allows you to connect entities that you could not easily connect before
2 点赞
1 收藏
4下载
确认
3秒后跳转登录页面
去登陆