Disrupting data discovery

"Lyft has reduced the time it takes to discover data by 10x by building its own data portal, Amundsen. Amundsen is built on three key pillars: an augmented data graph, an intuitive user experience, and centralized metadata. Amundsen uses a graph database under the hood to store relationships be...

Descripción completa

Detalles Bibliográficos
Autor Corporativo:	O'Reilly (Firm) (-)
Otros Autores:	Grover, Mark, on-screen presenter (onscreen presenter), Feng, Tao, on-screen presenter
Formato:	Vídeo online
Idioma:	Inglés
Publicado:	[Place of publication not identified] : O'Reilly Media 2019.
Materias:	Lyft (Firm) Strata Conference > (2019 : > San Francisco, California) Business enterprises > Computer networks. Decision making > Data processing. Electronic data processing > Management.
Ver en Biblioteca Universitat Ramon Llull:	https://discovery.url.edu/permalink/34CSUC_URL/1im36ta/alma991009822782906719

Descripción
Sumario:	"Lyft has reduced the time it takes to discover data by 10x by building its own data portal, Amundsen. Amundsen is built on three key pillars: an augmented data graph, an intuitive user experience, and centralized metadata. Amundsen uses a graph database under the hood to store relationships between various data assets (tables, dashboards, protobuf events, etc.). What's unique to Amundsen is that it treats people as a first-class data asset; in other words, there's a graph node for each person in the organization that connects to other nodes (like tables, and dashboards). In addition, Amundsen runs PageRank using data from access logs to power search ranking, similar to how Google ranks web pages on the internet. Finally, Amundsen gathers metadata from various different sources (Hive, Presto, Airflow, etc.) and exposes it in one central place. The right place to store all this metadata is a work in progress. Mark Grover and Tao Feng (Lyft) offer a demo of Amundsen and lead a deep dive into its architecture, covering how it leverages centralized metadata, page rank, and a comprehensive data graph to achieve its goal. They also explore the future roadmap, unsolved problems, and its collaboration model. This session was recorded at the 2019 O'Reilly Strata Data Conference in San Francisco."--Resource description page.
Notas:	Title from title screen (viewed January 20, 2020).
Descripción Física:	1 online resource (1 streaming video file (42 min., 6 sec.)) : digital, sound, color

Disrupting data discovery

Ejemplares similares