Jump to content

Data Engineering & Data Science

Data Engineering

  • Data Pipelines (ETL/ELT)

  • Big Data Technologies

  • Cloud Computing for Data

  • Data Governance & Quality

Data Science

  • Machine Learning (ML)

  • Statistical Analysis

  • Data Visualization

  • Natural Language Processing (NLP)

  1. The world is currently data-driven, and most businesses and organizations extract valuable insights from their data to gain a competitive advantage. This is where ETL (Extract, Transform, and Load) and SQL (Structured Query Language processes come into play. In this write-up, you will explore the relationship between ETL and SQL, analyze how SQL is used […]View the full article

    • 0 replies
    • 17 views
  2. Started by Databricks,

    Databricks was built as an open and unified platform to handle huge data workloads at a fraction of the cost of other solutions... View the full article

    • 0 replies
    • 17 views
  3. At Zafin , our mission is to help banks modernize their core infrastructure to deliver exceptional, personalized experiences to their customers. To determine... View the full article

    • 0 replies
    • 12 views
  4. In our previous blog , we explored the methodology recommended by our Professional Services teams for executing complex data warehouse migrations to Databricks... View the full article

    • 0 replies
    • 15 views
  5. This blog describes the new change feed and snapshot capabilities in Apache Spark™ Structured Streaming’s State Reader API. The State Reader API enables... View the full article

    • 0 replies
    • 17 views
  6. The average organization generates 2.5 quintillion bytes1 of data daily. Businesses globally prioritize data management due to its exponential growth. How can organizations extract, convert, and load (ETL) meaningful and useable data with so much to process? A robust architecture is crucial to modern data management. This blog will explain how they can assist your […]View the full article

    • 0 replies
    • 16 views
  7. The term ‘lineage’ mainly creates a genealogy or family background or the manner in which people are related across the generations. Data lineage is no different in concept. It gives a chronological account of the extended family of your data, from where it originated to the intermediate transformations it undergoes and where it ends up. […]View the full article

    • 0 replies
    • 17 views
  8. In today’s fast-paced digital landscape, businesses face the daunting challenge of extracting valuable insights from large amounts of data. The ETL (Extract, Transform, Load) pipeline is the backbone of data processing and analysis. Whether you are a seasoned data engineer or a beginner in this data-driven adventure, this blog will help you build a powerful […]View the full article

    • 0 replies
    • 15 views
  9. In today's fast-paced world, utility companies face numerous challenges when it comes to outage response and restoration, especially during severe weather events. The... View the full article

    • 0 replies
    • 11 views
  10. Databricks has been recognized as one of the winners of the annual Glassdoor Employees’ Choice Awards, a list of the Best Places to... View the full article

    • 0 replies
    • 15 views
  11. Electronic products are evolving at lightning speed, driven by an insatiable demand for new consumer devices, energy, transport, robotics, connectivity, data and beyond... View the full article

    • 0 replies
    • 13 views
  12. SELECT 'Hello world!' COLLATE UNICODE, 'Zdravo svete!' COLLATE SR, 'Γειά σου, Κόσμε!' COLLATE EL, 'Здравствуй, мир!' COLLATE RU, '你好, 世界!' COLLATE ZH, 'Bonjour... View the full article

    • 0 replies
    • 15 views
  13. Data movement is essential for synchronizing and managing data for business intelligence and decision-making. Unlock your data's value today.View the full article

    • 0 replies
    • 0 views
  14. We are excited to announce that egress control for Databricks serverless and Mosaic AI Model Serving workloads is available in Public Preview on... View the full article

    • 0 replies
    • 17 views
  15. Aon plc is a leading global firm providing risk, reinsurance, retirement, and health solutions. Focusing on data-driven insights, Aon operates in over 120... View the full article

    • 0 replies
    • 12 views
  16. At Databricks, our automation vision is to automate all aspects of the business, making it better, faster, and cheaper. For the sales teams... View the full article

    • 0 replies
    • 16 views
  17. Introduction MLOps is an ongoing journey, not a once-and-done project. It involves a set of practices and organizational behaviors, not just individual tools... View the full article

    • 0 replies
    • 15 views
  18. Every organization is challenged with correctly prioritizing new vulnerabilities that affect a large set of third-party libraries used within their organization. The sheer... View the full article

    • 0 replies
    • 12 views
  19. Javier Lagares is a Principal Data Engineer at HP, where he leads the development of data-driven solutions for the 3D printing business. With... View the full article

    • 0 replies
    • 15 views
  20. AI remains at the forefront of every business leader’s plans for 2025. Overall, 70% of businesses continue to believe AI is critical to... View the full article

  21. We are excited to announce that Gartner has recognized Databricks as a Leader for a fourth consecutive year in the 2024 Gartner® Magic... View the full article

  22. We're excited to announce the Public Preview of credential vending for Unity Catalog’s open APIs, allowing external clients to securely access Unity Catalog... View the full article

  23. Started by Databricks,

    Since its launch in 2023, Databricks Assistant has grown to hundreds of thousands of monthly users, including developers at major enterprises like Rivian... View the full article

  24. Introduction Databricks has joined forces with the Virtue Foundation through Databricks for Good, a grassroots initiative providing pro bono professional services to drive... View the full article

  25. Staying competitive in Major League Soccer (MLS) demands building and maintaining a strong squad through strategic roster planning and smart, effective navigation of... View the full article

  26. Czech savings bank Česká spořitelna , a division of Austria’s Erste Group , recently collaborated with AI solution builder DataSentics to explore the... View the full article

  27. Started by Databricks,

    Large language models are improving rapidly; to date, this improvement has largely been measured via academic benchmarks. These benchmarks, such as MMLU and... View the full article

  28. We’re excited to announce the Public Preview of Query Git integration as part of the new SQL Editor . Git support for queries... View the full article

  29. We’re excited to announce a joint effort between Databricks for Games and GameAnalytics. This blog and associated code will help our mutual customers... View the full article

  30. Book at meeting wtih Databricks at NRF 2025! As we approach January 2025, the retail industry is gearing up for another groundbreaking Retail's... View the full article

  31. Seven West Media’s 7plus is one of Australia’s leading streaming platforms for broadcast VOD (video on demand), enabling audiences to livestream broadcast content... View the full article

  32. While nearly 80% of the world’s data is in video format, enabling search and understanding on video data has historically been a challenging... View the full article

  33. We just followed the documentation online, and within a few hours, we were operational and started running a job. We never had any... View the full article

  34. As enterprises build agent systems to deliver high quality AI apps, we continue to deliver optimizations to deliver best overall cost-efficiency for our... View the full article

  35. In this first part of a two-part blog series, we demonstrate how generative AI coupled with customer data can help marketing teams generate... View the full article

  36. We’re excited to announce the Public Preview of Hive Metastore (HMS) and AWS Glue Federation in Unity Catalog! This new capability enables Unity... View the full article

  37. What makes a great partnership? For Databricks and AWS, it’s not just about building together—it’s about helping businesses succeed together. At AWS re:Invent... View the full article

  38. We are pleased to announce the winners of the Databricks Generative AI Startup Challenge , a competition held in collaboration with AWS to... View the full article

  39. Introduction Building production-grade, scalable, and fault tolerant Generative AI solutions requires having reliable LLM availability. Your LLM endpoints must be ready to meet... View the full article

  40. Inspiration Going on vacation is an enjoyable experience, but planning the trip can take time and effort for most people. There are numerous... View the full article

  41. Data engineering teams are frequently tasked with building bespoke ingestion solutions for myriad custom, proprietary, or industry-specific data sources. Many teams find that... View the full article

  42. * Explore how startups using Databricks achieve higher revenue and innovation. * Learn about the Databricks Unicorn Index and its insights. * Discover real-world success stories from unicorns and emerging unicorns powered by the Databricks Data Intelligence Platform. View the full article

  43. In today’s rapidly evolving technology landscape, generative artificial intelligence (GenAI) is revolutionizing the way organizations work and is opening up new worlds of... View the full article

  44. Iceberg maintains consistency and atomicity of metadata files. Learn how to connect Unity Catalog's Iceberg REST APIs to Snowflake to read a single source data file as Iceberg. View the full article

  45. Started by Databricks,

    Databricks is proud to be a platinum sponsor of NeurIPS 2024. The conference runs from December 10 to 15 in Vancouver, British Columbia... View the full article

  46. Our customers continue to shift from monolithic prompts with general-purpose models to specialized agent systems to achieve the quality needed to drive ROI... View the full article

  47. Started by Databricks,

    Equiniti wanted to centralize data and insights to its operations. To this end, it utilized the Databricks Data Intelligence Platform and Mosaic AI tools to enhance customer experience and drive innovation. View the full article

  48. At Databricks, AutoML is our low-code/no-code model training API that empowers customers to create quality machine learning (ML) models with their data on... View the full article

  49. Started by Databricks,

    In recent years, artificial intelligence has transformed from an aspirational technology to a driver of manufacturing innovation and efficiency. Understanding both the current... View the full article

  50. Databricks launches two new self-paced trainings to enhance SQL and AI-powered analytics skills The "Get Started with SQL analytics and BI" course covers how to use Databricks SQL for data analysis and Databricks AI/BI Dashboards and Genie spaces Additional courses being developed include "Databricks AI/BI for self-service analytics" and a deep dive for data analysts on building AI/BI Dashboards and Genie Spaces View the full article