Data + AI Summit 2022 于2022年06月27日至30日举行。本次会议是在旧金山进行,中国的小伙伴是可以在线收听的,一共为期四天,第一天是培训,后面几天才是正式会议。本次会议有超过200个议题,演讲嘉宾包括业界、研究和学术界的专家,本次会议主要分为六大块:

数据分析, BI 以及可视化:了解最新的数据分析、BI 和可视化技术以及客户和社区的解决方案。•数据工程:从实现数据管道到管理数据质量、ETL和数据质量框架再到数据 ops,深入了解最新的数据工程知识。•Data Lakes, Data Warehouses and Data Lakehouses:了解数据湖和数据仓库演变为 Data Lakehouses 背后的概念和最佳实践;•数据科学, 机器学习以及 MLOps:了解关于生产数据科学和机器学习管道的技术和最佳实践。•数据安全和治理:•学术研究:致力于学术和先进的工业研究领域,包括大规模调度程序,图表,数据分析和机器学习系统。

会议的全部日程请参见:https://databricks.com/dataaisummit/agenda

如果想及时了解Spark、Hadoop或者HBase相关的文章,欢迎关注微信公众号:过往记忆大数据

本次会议的第一天 KeyNote 宣布了几件重要的事情:Apache Spark 后续发展、下一代 Structured Streaming 解决方案、Delta Lake 的功能全部开源。除了第一天的 KeyNote,下面几个议题也推荐大家看看:

•Apache Spark SQL Aggregate Improvement at Meta (Facebook)

•Recent Parquet Improvements in Apache Spark

•Spark Data Source V2 Performance Improvement: Aggregate Push Down

•Deep Dive into the New Features of Apache Spark 3.2 and 3.3

•Managing Straggler Executors at Apache Spark 3.3

•Apache Spark on Kubernetes—Lessons Learned from Launching Millions of Spark Executors

•PySpark in Apache Spark 3.3 and Beyond

•Delta Lake 2.0 Overview

•Improving Interactive Querying Experience on Spark SQL

•Moving from Apache Spark 2 to Apache Spark 3: Spark Version Upgrade at Scale in Pinterest

•Radical Speed on the Lakehouses: Photon under the hood

超清视频下载途径

考虑到大家可能对不同的主题感兴趣,这里给大家整理了所有可以下载的视频,全部是超清,大家可以根据自己的兴趣去下载观看。另外,会议的 PPT 当前还不可以下载,需要 PPT 的同学可以继续关注本公众号,获取相关消息。

关注微信公众号 过往记忆大数据 或者 Java与大数据架构 并回复 10187 获取 Data + AI Summit 2022 超清视频。

可下载视频的议题

本次可下载视频的议题共 197 个。

•A Low-Code Approach to 10x Data Engineering•A Modern Approach to Big Data for Finance•A Practitioner's Guide to Unity Catalog—A Technical Deep Dive•Accelerating Hybrid Data Mesh Implementation•Accidentally Building a Petabyte-Scale Cybersecurity Data Mesh in Azure With Delta Lake at HSBC•Administrator Best Practices and Tips for Future-proofing your Databricks Account•Advanced Migrations From Hive to SparkSQL•Adversarial Drifts Model Monitoring and Feedback Loops Building Human-in-the-Loop Machine Learning Systems for Content Moderation•An Advanced S3 Connector for Spark to Hunt for Cyber Attacks•Analyzing Population Health using Healthcare Claims•Apache Arrow Flight SQL: High Performance, Simplicity, and Interoperability for Data Transfers•Apache Spark SQL Aggregate Improvement at Meta Facebook•Apache Spark on Kubernetes—Lessons Learned from Launching Millions of Spark Executors•Automate Your Delta Lake or Practical Insights on Building Distributed Data Mesh•Automating Model Lifecycle Orchestration with Jenkins•Backfill Streaming Data Pipelines in Kappa Architecture•Batches Streams and Everything in between Unifying Batch and Stream Storage with Apache Pulsar and Lakehouse Architectures•Beyond Daily Batch Processing Operational Trade-Offs of Microbatch Incremental and Real-Time Processing for Your ETLs (and Your Team's Sanity)•Beyond Monitoring The Rise of Data Observability•Build an Enterprise Lakehouse for Free with Trino and Delta Lake•Building Enterprise Scale Data and Analytics Platforms at Amgen•Building Metadata and Lineage Driven Pipelines on Kubernetes•Building Production-Ready Recommender Systems with Feature Stores•Building Scalable & Advanced AI based Language Solutions for R&D using Databricks•Building Spatial Applications with Apache Spark and CARTO•Building a Lakehouse for Data Science at DoorDash•Building a Lakehouse on AWS for Less with AWS Graviton and Photon•Building an Operational Machine Learning Organization from Zero and Leveraging ML for Crypto Security•Building and Scaling Machine Learning-Based Products in the World's Largest Brewery•Chaos Engineering in the World of Large-Scale Complex Data Flow•Cloud Native Geospatial Analytics at JLL•Cloud and Data Science Modernization of Veterans Affairs Financial Service Center with Azure Databricks•Complete Data Security and Governance Powered by Unity Catalog and Immuta•Connecting the Dots with DataHub Lakehouse and Beyond•Constraints, Democratization, and the Modern Data Stack - Building a Data Platform At Red Ventures with Fivetran and Databricks•Coral and Transport Portable SQL and UDFs for the Interoperability of Spark and Other Engines•Correlation Over Causation Cracking the Relationship Between User Engagement and User Happiness•Cutting the Edge in Fighting Cybercrime Reverse-Engineering a Search Language to Cross-Compile it to PySpark•DELETE UPDATE MERGE Operations in Data Source V2•Data Lakehouse and Data Mesh—Two Sides of the Same Coin•Data Mesh Implementation Patterns•Data Warehousing on the Lakehouse•DataFusion and Arrow: Supercharge Your Data Analytical Tool with a Rusty Query Engine•Databricks Lakehouse Overview•Databricks SQL Under the Hood: What's New with Live Demos•Day 1 Afternoon Keynote•Day 2 Afternoon Keynote•Day 2 Opening Keynote•Deep Dive How to Build Your Modern Data Stack on Databricks to Solve Modern Problems•Deep Dive into the New Features of Apache Spark 3.2 and 3.3•Deliver Faster Decision Intelligence From Your Lakehouse•Delta Lake, the Foundation of Your Lakehouse•Delta Live Tables Modern software engineering and management for ETL•Delta Sharing - A New Paradigm for Secure Data Sharing and Data Collaboration on Lakehouse•Delta Sharing for Healthcare and Life Sciences•Democratizing Metrics at Airbnb•Designing Better MLOps Systems•Destination Lakehouse All Your Data Analytics and AI on One Platform•Distributed Machine Learning at Lyft•Dive Deeper into Data Engineering on Databricks•Doubling the Capacity of the Data Platform Without Doubling the Cost•Driving Real-Time Data Capture and Transformation in Delta Lake with Change Data Capture•Efficient and Multi-Tenant Scheduling of Big Data and AI Workloads•Eliminating AI Risk—One Model Failure at a Time•Emerging Data Architectures & Approaches for Real-Time AI using Redis•Enable Production ML with Databricks Feature Store•Enabling Advanced Analytics at The Department of State using Databricks•Enabling BI in a Lakehouse Environment How Spark and Delta Can Help With Automating a DWH Development•Enabling Business Users to Perform Interactive Ad-Hoc Analysis over Delta Lake with No Code•Ensuring Correct Distributed Writes to Delta Lake in Rust with Formal Verification•Entity Resolution•Evolution of Data Architectures and How to Build a Lakehouse•Financial Services Industry Forum: The Future of Financial Services is Open with Data and AI at Its Core•Fugue Tune Distributed Hybrid Hyperparameter Tuning•FugueSQL—The Enhanced SQL Interface for Pandas and Spark DataFrames•FutureMetrics Using Deep Learning to Create a Multivariate Time Series Forecasting Platform for Economic Strategic Planning•Gamer User Toxicity•Gazelle-Jni: A Middle Layer to Offload Spark SQL to Native Engines for Execution Acceleration•Government Industry Forum Lunch and Program•Hassle-Free Data Ingestion into the Lakehouse•Healthcare Data Interoperability•How AARP Services Inc. automated SAS transformation to Databricks using LeapLogic—A cloud accelerator for transformation of legacy analytics ETL DW & Hadoop•How AT&T Data Science Team Solved an Insurmountable Big Data Challenge on Databricks with Two Different Approaches using Photon and RAPIDS Accelerator for Apache Spark•How Databricks is driving disruptive digital transformation in the airline industry•How EPRI Uses Computer Vision to Mitigate Wildfire Risks for Electric Utilities•How McAfee Leverages Databricks on AWS at Scale•How Robinhood Built a Streaming Lakehouse to Bring Data Freshness from 24h to Less Than 15 Mins•How To Make Apache Spark on Kubernetes Run Reliably on Spot Instances•How To Use Databricks SQL for Analytics on Your Lakehouse•How to Implement a Semantic Layer for Your Lakehouse•How unsupervised machine learning can scale data quality monitoring in Databricks•Immuta - Unlocking sensitive use cases with automated data access•Implementing a Framework for Data Security and Policy at a Large Public Sector Agency•Implementing an End-to-End Demand Forecasting Solution Through Databricks and MLflow•Improving Apache Spark Structured Streaming Application Processing Time by Configurations Code Optimizations and Custom Data Source•Improving Interactive Querying Experience on Spark SQL•Introducing Zipline An Open Source Feature Engineering Platform•Introduction to Flux and OSS Replication•Lakehouse with Delta Lake Deep Dive•Laying the Foundation for Claims Automation•Learn to Efficiently Test ETL Pipelines•Lessons Learned from Deidentifying 700 Million Patient Notes•Leveraging ML-Powered Analytics for Rapid Insights and Action a demonstration•Live Analytics: The next user engagement frontier•Low-Code Machine Learning on Databricks with AutoML•ML on the Lakehouse Bringing Data and ML Together to Accelerate AI Use Cases•MLOps at DoorDash•MLflow Pipelines Accelerating MLOps from Development to Production•Managing Straggler Executors at Apache Spark 3.3•Meetup Women in Data and AI•Meshing About with Databricks•Migrate Your Existing DAGs to Databricks Workflows•Migrate and Modernize your Data Platform with Confluent and Databricks•Migrating Complex SAS Processes to Databricks - Case Study•Migrating SAS to a Lakehouse on Databricks and S3•Monitoring and Quality Assurance of Complex ML Deployments via Assertions•More Context Less Chaos How Atlan and Unity Catalog Power Column-Level Lineage and Active Metadata•Mosaic: A Framework for Geospatial Analytics at Scale•Moving from Apache Spark 2 to Apache Spark 3 Spark Version Upgrade at Scale in Pinterest•Multi-Touch Attribution•Multimodal Deep Learning Applied to E-commerce Big Data•Near Real-Time Analytics with Event Streaming Live Tables and Delta Sharing•Nixtla: Deep Learning for Time Series Forecasting•Opening the Floodgates Enabling Fast Unmediated End User Access to Trillion-Row Datasets with SQL Data Warehouses•Operational Analytics: Expanding the Reach of Data in the Lakehouse Era•Optimizing Speed and Scale of User-Facing Analytics Using Apache Kafka and Pinot•Orchestration Made Easy with Databricks Workflows•OvalEdge End-To-End Data Governance•Patient Cohort Building with NLP and Knowledge Graphs•Powering Up the Business with a Lakehouse•Practical Data Governance in a Large Scale Databricks Environment•Predicting Repeat Admissions to Substance Abuse Treatment with Machine Learning•Predicting and Preventing Machine Downtime with AI and Expert Alerts•Propensity Scoring Demo•Protecting Personally Identifiable Information (PII)/PHI Data in Data Lake via Column Level Encryption•Pushing the limits of scale and performance for enterprise-wide analytics: A fire-side chat with Akamai•PySpark in Apache Spark 3.3 and Beyond•Radical Speed on the Lakehouse Photon Under the Hood•Real Time Bidding•Real Time Retail Demo•Real World Evidence and Propensity Score Matching•Real-Time Search and Recommendation at Scale Using Embeddings and Hopsworks•Real-time Risk Management with Confluent & Databricks•Realize the Promise of Streaming with the Databricks Lakehouse Platform•Recent Parquet Improvements in Apache Spark•Regulatory Reporting: Automatically translate enterprise data models into efficient data pipelines•Retail Industry Forum•Rethinking Orchestration as Reconciliation Software-Defined Assets in Dagster•Running a Low Cost Versatile Data Management Ecosystem with Apache Spark at Core•SAS Migration•Scaling AI Workloads with the Ray Ecosystem•Scaling Deep Learning on Databricks•Scaling ML at CashApp with Tecton•Scaling Salesforce In-Memory Streaming Analytics Platform for Trillion Events Per Day•Scaling Your Workloads with Databricks Serverless•Search and Aggregations Made Easy with OpenSearch and NodeJS•Serverless Kafka and Apache Spark in a Multi-Cloud Data Lakehouse Architecture•Serving Near Real-Time Features at Scale•Simplifying Migrations to Lakehouse—the Databricks Way•Sink Framework Evolution in Apache Flink•Smart Manufacturing Real-time Process Optimization with Databricks•So Fresh and So Clean: Learn How to Build Real-Time Warehouses on Lakehouse•Sound Data Engineering in Rust—From Bits to DataFrames•Spark Inception: Exploiting the Apache Spark REPL to Build Streaming Notebooks•Stadium Analytics•Streaming Data into Delta Lake with Rust and Kafka•Streaming ML Enrichment Framework Using Advanced Delta Table Features•Supercharge your SaaS applications with a modern cloud-native database•Survey of Production ML Tech Stacks•Tackling Challenges of Distributed Deep Learning with Open Source Solutions•Take Databricks Lakehouse to the Max with Informatica•Technical and Tactical Football Analysis Through Data•The Databricks Notebook Front Door of the Lakehouse•The Future is Open - a Look at Google Cloud’s Open Data Ecosystem•The Future of Data - What’s Next with Google Cloud•The Road to a Robust Data Lake Utilizing Delta Lake and Databricks to Map 150 Million Miles of Roads a Month•Tools for Assisted Apache Spark Version Migrations From 2.1 to 3.2+•Towards Dynamic Microstructure The Role of Machine Learning in the Next Generation of Exchanges•Tredence On Shelf Availability•Turbocharge your AI/ML Databricks workflows with Precisely•Turning Fan Data Into an Asset•Unifying Data Science and Business Artificial Intelligence Augmentation and Integration into Production Business Applications•What to Do When Your Job Goes OOM in the Night Flowcharts•Why a Data Lakehouse is Critical During the Manufacturing Apocalypse•You Have BI. Now What Activate Your Data•Your fastest path to Lakehouse and beyond•dbt + Machine Learning What Makes a Great Baton Pass•dbt and Databricks: Analytics Engineering on the Lakehouse

Data + AI Summit 2022 超清视频下载相关推荐

  1. 如何使用Bandicam录制1080P超清视频上传优酷?

    如果使用Bandicam录制1080P超清视频上传优酷? 很简单,打开Bandicam!然后:"录像"设置- 如图! 购买官方正版Bandicam 请联系我们: --------- ...

  2. 超清视频时代要来了吗?

    超清视频产业链的集体气风不是巧合.虽然,该产业目前仍处于稚嫩期,但一些垂直领域的发展已相当快速.在未来,想必会有越来越多的企业成为新时代的宠儿. @数科星球 原创 作者丨科科 编辑丨十里香 2022年 ...

  3. 华人一作登ICCV 2021,实时超分新SOTA!AutoML显神威:1%参数量,超清视频70倍加速...

      视学算法报道   作者:詹政 编辑:好困 小咸鱼 [新智元导读]东北大学王言治团队将网络结构搜索与剪枝搜索相结合,提出了全新的自动搜索框架.该AutoML框架得到的稀疏模型能够在移动设备上实时且高 ...

  4. Adobe After Effects与Adobe Premiere导出超清视频且保持正常大小方法

    Adobe After Effects与Adobe Premiere导出超清视频且保持正常大小方法 Adobe After Effects导出视频默认无损AVI格式,播放异常卡顿,本文主要介绍如何设置 ...

  5. Spark+AI Summit Europe 2019 超清视频PPT下载

    为期三天的 SPARK + AI SUMMIT Europe 2019 于 2019年10月15日-17日荷兰首都阿姆斯特丹举行.数据和 AI 是需要结合的,而 Spark 能够处理海量数据的分析,将 ...

  6. MP4视频格式转换器怎样转换MKV超清视频格式

    在日常生活中,看电视剧电影都是常有的事情,但是观看高清版的视频比一般的甚至模糊的体验肯定是不一样的.其实模糊的视频是可以转为高清乃至超清的文件,只不过需要使用到视频格式转换器,那么哪款视频转换器转换过 ...

  7. 金山云发布画质增强KIE,AI赋能提升超清体验

    自从苹果开创无键盘设计时代之后,手机厂商便把人机交互中最重要的屏幕相关技术作为核心的差异化手段.除了不断优化包括图像采集相关的能力,更是在屏幕尺寸和分辨率上不断推陈出新.比如手机从厚到薄,从5英寸以内 ...

  8. 西电AI专业排名超清北,南大蝉联全国第一 | 2022软科中国大学专业排名

    金磊 发自 凹非寺 量子位 | 公众号 QbitAI 中国高校人工智能专业哪家强? 就在最近,软科中国大学专业排名新鲜出炉. 话不多说,结果直接奉上. 在此次排名中,专业评级最高的"A+&q ...

  9. 视频压缩器哪个好?最高可压缩8K超清视频的神器,就问你爱不爱?

    最近线上办公非常频繁,经常要用到视频压缩器,但是一直没有找到一款高清无损的且好用的视频压缩器.前两天在网上被安利了一款叫万兴优转的视频压缩器,用下来真的爱不释手,就迫不及待的来分享给有需要的朋友啦. ...

最新文章

  1. cocos2d-x 音效中断问题
  2. iOS的一些常用性能优化,和内存优化的方法
  3. 2013年3月空调类品牌网络知名度排名
  4. jQuery学习笔记(四)——表单选择
  5. 在计算机的通信子网中 其操作方式有两种,全国自考计算机网技术模拟试卷(八)及答案.doc...
  6. [html] input上传文件可以同时选择多张吗?怎么设置?
  7. mysql8支持myISAM_mysql8 参考手册--优化MyISAM表
  8. cas登录成功后不跳转成功页面_SpringSecurity用户登录成功后页面跳转原理剖析
  9. 【数字逻辑设计】毛刺
  10. 2013 前瞻 + 技术牛
  11. MySql Server 5.5安装教程
  12. Google Chrome 启动 0x0000005 崩溃问题的解决方法
  13. CentOS添加新硬盘和硬盘格式化
  14. html导航条背景颜色代码,div+css实现鼠标经过背景高亮的导航菜单代码
  15. PIPIOJ1166PIPI的棋盘
  16. 徒步运动软件怎么申请测试,专业人士教你如何顺利完赛50公里徒步
  17. SaaS企业如何构建与自身增长目标相匹配的市场力?
  18. 疫情期间想做兼职增加收入,那你应该看看这四个网赚项目
  19. TIA博途WINCC的触摸屏VB脚本入门学习(IF THEN ELSE判断语句)
  20. Java HashMap中在resize()时候的rehash,即再哈希法的理解

热门文章

  1. android手机改车载,到底手机导航还是车载导航好,我选择在车上加装安卓大屏...
  2. 用stm32F103核心板的GPIOA端一管脚接一个LED,GPIOB端口一引脚接一个开关(用杜邦线模拟代替)。采用中断模式编程,当开关接高电平时,LED亮灯;接低电平时,LED灭灯。
  3. Jordan标准形(番外篇)——Jordan块的最小多项式
  4. Centos7.6升级到Centos8.0
  5. html+css自适应窗口+禅意花园
  6. 小米盒子 装android,沙发管家教你如何用小米盒子安装非安卓大型游戏
  7. 无公网IP,外网远程连接MySQL数据库
  8. 腾讯会议后台研发效能提升之路
  9. 【毕业设计】基于python的旅游数据分析可视化系统 -大数据 旅游数据分析
  10. object.getClass()和Object.class的区别