Jump to content

Data Engineering & Data Science

Data Engineering

  • Data Pipelines (ETL/ELT)

  • Big Data Technologies

  • Cloud Computing for Data

  • Data Governance & Quality

Data Science

  • Machine Learning (ML)

  • Statistical Analysis

  • Data Visualization

  • Natural Language Processing (NLP)

  1. In the rapidly evolving landscape of data management, data warehousing continues to be a cornerstone for businesses seeking to harness the power of... View the full article

  2. As a global media conglomerate housing over 37 distinct brands, Condé Nast faced the challenge of delivering targeted consumer experiences across their brands... View the full article

  3. At the time of writing this blogpost, I'm a mere one week away from the end of my summer internship on the Exploratory... View the full article

  4. Started by Databricks,

    Within the Databricks Community, there is a technical blog where community members share best practices, tutorials and insights on data analytics, data engineering... View the full article

  5. Started by Databricks,

    We're thrilled to launch our 2024 Data + AI World Tour , a series of free in-person events in cities worldwide. Each stop... View the full article

  6. When it comes to GenAI in the enterprise, excitement is colliding with reality. Leaders recognize the technology's power and eagerly want to unleash... View the full article

  7. A recent MIT Tech Review Report shows that 71% of surveyed organizations intend to build their own GenAI models. As more work to... View the full article

  8. Every business wants to be a data and AI vanguard. But to make that happen, companies must commit to a GenAI vision and... View the full article

  9. A guide for developers to create custom DSLs to address specific domain requirements, enhancing productivity and unlocking new possibilities in problem-solving.View the full article

  10. One of the most exciting parts of the Data + AI Summit is hearing about all the ways our over 10,000 global customers... View the full article

  11. Understanding CDP vs DMP: Is a customer data platform or a data management platform the best software for your business' data management future?View the full article

  12. Started by Databricks,

    Introduction Twelve Labs Embed API enables users to use natural language to explore the content of video libraries, as well as generate summaries... View the full article

  13. We're excited to announce that looping for Tasks in Databricks Workflows with For Each is now Generally Available! This new task type makes... View the full article

  14. We recently announced the general availability of serverless compute for Notebooks, Workflows, and Delta Live Tables (DLT) pipelines. Today, we'd like to explain... View the full article

  15. We're excited to announce the general availability of hybrid search in Mosaic AI Vector Search. Hybrid search is a powerful feature that combines... View the full article

  16. All the code is available in this GitHub repository . Prior to reading this blog we recommend reading Getting Started with Delta Live... View the full article

  17. Today, we're excited to announce the launch of Data Warehouse Brickbuilder Migration Solutions. This is an expansion to the Brickbuilder Program , which... View the full article

  18. Started by Databricks,

    Databricks Workflows is the cornerstone of the Databricks Data Intelligence Platform, serving as the orchestration engine that powers critical data and AI workloads... View the full article

  19. Special thanks to David Gray @Epsilon, Tanishq Bhalla @HealthVerity, Itai Weiss @ Nimble, JB Kole @ Mostly.ai for their valuable insights and contributions... View the full article

  20. Special thanks to Kevin Glover, Martin Ko, Kuber Sharma and the team at Tableau for their valuable insights and contributions to this blog... View the full article

  21. During my MBA internship this summer, I worked on several data projects. My favorite project was building a "virtual analyst" for our strategy... View the full article

  22. Started by Databricks,

    At Databricks, we want to make data and AI accessible to everyone on the planet. This is why we're building solutions like AI/BI... View the full article

  23. We are excited to announce the latest addition to the Databricks developer experience: the PyCharm Professional Integration with Databricks ! This new plugin... View the full article

  24. 1. Introduction The research and engineering community at large have been continuously iterating upon Large Language Models (LLMs) in order to make them... View the full article

  25. This blog was written in collaboration with Gordon Strodel, Director, Data Strategy & Analytics Capability, in addition to Abhinav Batra, Associate Principal, Enterprise... View the full article

  26. An Introduction to Time Series Forecasting with Generative AI Time series forecasting has been a cornerstone of enterprise resource planning for decades. Predictions... View the full article

  27. We are excited to announce that Graviton , the ARM-based CPU instance offered by AWS, is now supported on the Databricks ML Runtime... View the full article

  28. Databricks is thrilled to share that our University Alliance has welcomed its one-thousandth-member school! This milestone is a testament to our mission to... View the full article

  29. Welcome to the Generative AI World Cup 2024 , a global hackathon inviting participants to develop innovative Generative AI applications that solve real-world... View the full article

  30. Today, we are thrilled to announce that Databricks SQL Serverless is now Generally Available on Google Cloud Platform (GCP)! As a key component... View the full article

  31. Started by Databricks,

    Retrieval Augmented Generation (RAG) is the most widely adopted generative AI use case among our customers. RAG enhances the accuracy of LLMs by... View the full article

  32. Overview This blog post is a follow-up to the session From Supernovas to LLMs at Data + AI Summit 2024, where I demonstrated... View the full article

  33. What is AWS Glue AWS Glue is a serverless integration service that provides a simple, faster, and cheaper approach to discovering, preparing, and integrating data for modern ETL(Extract, Transform & Load) pipelines. Hence, data can be Extracted from the source, Transformed the way it is required, and Loaded into the data warehouse. It has a […]View the full article

    • 0 replies
    • 21 views
  34. In today's rapidly evolving technological landscape, the intersection of data and artificial intelligence (AI) has become a critical focus for organizations across industries... View the full article

  35. Rolls-Royce has witnessed the transformative power of the Databricks Data Intelligence Platform in various AI projects. One example is a collaboration between Rolls-Royce... View the full article

  36. Fueled by the exponential growth in external data and AI for innovation, organizations across all industries are looking for effective ways to collaborate... View the full article

  37. With the increase in data size and the diversity of data sources and destinations, companies and data teams are always on the lookout for tools that can simplify creating and managing data workflows. Many of these teams target cloud services because of their simplicity, low cost, and ability to scale and process terabytes of data. […]View the full article

    • 0 replies
    • 22 views
  38. Training a high-quality machine learning model requires careful data and feature preparation. To fully utilize raw data stored as tables in Databricks, running... View the full article

  39. At Data and AI Summit, we announced the general availability of Databricks Lakehouse Monitoring . Our unified approach to monitoring data and AI... View the full article

  40. We’re excited to announce the Public Preview of LakeFlow Connect for SQL Server, Salesforce, and Workday. These ingestion connectors enable simple and efficient... View the full article

  41. Companies across all industries want to share data with each other to enable collaboration and accelerate innovation. However, these organizations often use different... View the full article

  42. We are excited to announce a range of new integrations that will allow our customers to access and derive insights from their data... View the full article

  43. Introduction An organization adopting new technologies or on a modernization journey typically focuses on upcoming tools, their features and potential performance/cost improvements under... View the full article

  44. Started by Databricks,

    Financial Valuations & Comparative Analysis Financial institutions specialized in capital markets such as hedge funds, market makers and pension funds have long been... View the full article

  45. The transformative potential of artificial intelligence (AI) is undeniable. From productivity efficiency, to cost savings, and improved decision-making across all industries, AI is... View the full article

  46. Introduction Time series forecasting serves as the foundation for inventory and demand management in most enterprises. Using data from past periods along with... View the full article

  47. Today, we are excited to announce that Lakehouse Federation in Unity Catalog is now Generally Available (GA) across AWS, Azure, and GCP! Lakehouse... View the full article

  48. Dataricks is thrilled to announce the General Availability (GA) of Primary Key (PK) and Foreign Key (FK) constraints, starting in Databricks Runtime 15.2... View the full article

  49. As the Data Platform team at Databricks, we leverage our own platform to provide an intuitive, composable, and comprehensive Data and AI platform... View the full article

  50. The communications industry is experiencing immense change due to rapid technological advancements and evolving market trends. Communications service providers (CSP) build various solutions... View the full article