Data Engineering & Data Science
Data Engineering
Data Pipelines (ETL/ELT)
Big Data Technologies
Cloud Computing for Data
Data Governance & Quality
Data Science
Machine Learning (ML)
Statistical Analysis
Data Visualization
Natural Language Processing (NLP)
521 topics in this forum
-
At Databricks, we believe the future of business intelligence is powered by AI. That’s why we’re thrilled to announce the Databricks Smart Business Insights Challenge.View the full article
-
- 0 replies
- 5 views
- 1 follower
-
-
-
Today, AWS announces the general availability of AWS Glue G.4X and G.8X workers in the US West (N. California), Asia Pacific (Seoul), Asia Pacific (Mumbai), Europe (London), Europe (Spain), and South America (São Paulo) AWS regions. Glue G.4X and G.8X workers enable you to run your most demanding serverless data integration workloads in these additional regions. AWS Glue is a serverless, scalable data integration service that makes it simple to discover, prepare, move, and integrate data from multiple sources. AWS Glue G.4X and G.8X workers provide higher compute, memory, and storage resources than current Glue workers. These new types of workers help you scale and r…
-
- 0 replies
- 55 views
-
-
Are you a startup building core, customer-facing B2B products on Databricks? Then we have a Challenge for you! On the heels of our Generative AI...View the full article
-
- 0 replies
- 60 views
-
-
With today’s launch, AWS Clean Rooms provides additional privacy-enhancing controls to support aggregation and list analysis rules using the Spark analytics engine. Using AWS Clean Rooms Spark SQL, you and your partners can now manage how your data is used with aggregation, list, and custom analysis rules, running SQL queries with configurable resources based on your performance, scale, and cost requirements. For example, advertisers can use list analysis rules to create targeted audience segments from collective advertiser and publisher data sets without sharing the raw data used to create the segments. Similarly, publishers and their partners can run media planning a…
-
- 0 replies
- 32 views
-
-
Since our launch on Google Cloud Platform (GCP) in 2021, Databricks on Google Cloud has provided more than 1,500 joint customers with a tightly integrated...View the full article
-
- 0 replies
- 56 views
-
-
We’re excited to announce the General Availability of Lakeflow Connect for Salesforce and Workday. Lakeflow Connect introduces no-code ingestion connectors for popular SaaS applications, databases,...View the full article
-
- 0 replies
- 53 views
-
-
Introduction Game developers have always looked to build ongoing relationships with its players to maximize the play they bring to the world, and the success...View the full article
-
- 0 replies
- 64 views
-
-
Understanding GraphRAG What is a Knowledge Graph? To understand why one may use a Knowledge Graph (KG) instead of another structured data representation, it’s importantView the full article
-
- 0 replies
- 39 views
-
-
As more and more organizations embrace analytics, a wider range of problems are being brought forward to be solved. While data science teams are often...View the full article
-
- 0 replies
- 58 views
-
-
We’re excited to announce the Public Preview of the Microsoft Power BI task type in Databricks Workflows, available on Azure, AWS, and GCP. With this...View the full article
-
- 0 replies
- 49 views
-
-
Within the big data and analytics space there are two names at the forefront of conversation: Apache Spark and Databricks. While they’re closely related, they serve very different purposes in the data ecosystem. Understanding their core differences is critical for architects, developers, and data engineers looking to build scalable, high-performance data solutions in the cloud. […] The article Databricks vs Apache Spark: Key Differences and When to Use Each was originally published on Build5Nines. To stay up-to-date, Subscribe to the Build5Nines Newsletter. View the full article
-
- 0 replies
- 64 views
-
-
Willis Towers Watson (WTW) is a multinational company that provides a wide range of services in commercial insurance brokerage, risk management, employee benefits, and actuarial...View the full article
-
- 0 replies
- 55 views
-
-
Qwen models, developed by Alibaba, have shown strong performance in both code completion and instruction tasks. In this blog, we’ll show how you can register...View the full article
-
- 0 replies
- 50 views
-
-
Databricks enables organizations to securely share data, AI models, and analytics across teams, partners, and platforms without duplication or vendor lock-in. With Delta Sharing, Databricks...View the full article
-
- 0 replies
- 52 views
-
-
Databricks introduced last year Databricks Apps, completing its suite of tools that allows users to create and deploy applications directly on the Databricks Platform. With...View the full article
-
- 0 replies
- 47 views
-
-
We’re excited to announce that Anthropic Claude 3.7 Sonnet is now natively available in Databricks across AWS, Azure, and GCP. For the first time, youView the full article
-
- 0 replies
- 43 views
-
-
Prisma Cloud is the leading Cloud Security platform that provides comprehensive code-to-cloud visibility into your risks and incidents, offering key remediation capabilities to manage andView the full article
-
- 0 replies
- 38 views
-
-
Large language models are challenging to adapt to new enterprise tasks. Prompting is error-prone and achieves limited quality gains, while fine-tuning requires large amounts ofView the full article
-
- 0 replies
- 38 views
-
-
Databricks Apps provide a robust platform for building and hosting interactive applications. React is great for building modern, dynamic web applications that need to updateView the full article
-
- 0 replies
- 16 views
-
-
Driving Sustainable Aluminum Production: How to Calculate the Material Recovery Ratio with GraphFrames Sustainable production has become an imperative in today’s manufacturing market. According toView the full article
-
- 0 replies
- 20 views
-
-
We’re excited to announce the General Availability of Explore in Tableau, a new integration that lets you create Tableau Cloud visualizations directly from Unity Catalog...View the full article
-
- 0 replies
- 18 views
-
-
We’re making it easier than ever for Databricks customers to run secure, scalable Apache Spark™ workloads on Unity Catalog Compute with Unity Catalog Lakeguard. In...View the full article
-
- 0 replies
- 23 views
-
-
Training AI models for real-world applications require vast amounts of labeled data, which can be costly, time-consuming, and difficult to obtain at scale. Synthetic data...View the full article
-
- 0 replies
- 18 views
-
-
We’re excited to announce the General Availability of Hive Metastore (HMS) and AWS Glue Federation in Unity Catalog! This new capability enables Unity Catalog to...View the full article
-
- 0 replies
- 26 views
-
-
Introduction In this blog, we share the journey of building a Serverless optimized Artifact Registry from the ground up. The main goals are to ensure...View the full article
-
- 0 replies
- 15 views
-
-
At Home Trust, we measure success in terms of relationships. Whether we’re working with individuals or businesses, we strive to help them stay “Ready for...View the full article
-
- 0 replies
- 15 views
-
-
Generative AI is transforming how organizations interact with their data, and batch LLM processing has quickly become one of Databricks' most popular use cases. Last...View the full article
-
- 0 replies
- 16 views
-
-
In today’s dynamic retail environment, staying connected to customer sentiments is more crucial than ever. With shoppers sharing their experiences across countless platforms, retailers are...View the full article
-
- 0 replies
- 16 views
-
-
Earlier this week, we announced new agent development capabilities on Databricks. After speaking with hundreds of customers, we've noticed two common challenges to advancing beyond...View the full article
-
- 0 replies
- 16 views
-
-
기업들이 전략적 의사 결정을 내릴 때 데이터 기반 인사이트를 적극 활용함에 따라 데이터 인텔리전스 플랫폼의 최신 트렌드는 더욱 정교하고 확장 가능하며 안전한 솔루션으로 발전하는 방향을...View the full article
-
- 0 replies
- 12 views
-
-
As part of our Week of AI agents initiative, we’re introducing new capabilities to help enterprises build and govern high-quality AI agents. To that end,...View the full article
-
- 0 replies
- 24 views
-
-
As we mentioned in our blog earlier this week, AI agents require enterprise data integration and output governance to achieve production quality. Today we're launching...View the full article
-
- 0 replies
- 17 views
-
-
While 85% of global enterprises already use Generative AI (GenAI), organizations face significant challenges scaling these projects beyond the pilot phase. Even the most advanced...View the full article
-
- 0 replies
- 13 views
-
-
Marketers have long dreamed of one-on-one customer engagement, but crafting the volume of messages required for personalized engagement at that level has been a major...View the full article
-
- 0 replies
- 14 views
-
-
We’re excited to announce the General Availability of Lakehouse Federation for Google BigQuery and the Public Preview for Oracle and Teradata. Now, you can connect,...View the full article
-
- 0 replies
- 14 views
-
-
We’re excited to announce the Public Preview of Automatic Liquid Clustering, powered by Predictive Optimization. This feature automatically applies and updates Liquid Clustering columns on...View the full article
-
- 0 replies
- 16 views
-
-
Databricks is excited to announce an expansion to our startup offer, providing game studios access to free credits, expert advice and a data and AI...View the full article
-
- 0 replies
- 17 views
-
-
Microsoft has announced the official retirement of Azure Data Studio on February 28, 2026. This decision marks a significant shift in Microsoft’s approach to database development tools, as users are encouraged to transition to Visual Studio Code (VS Code) with the appropriate extensions for database management and SQL development. What is Azure Data Studio? Azure […] The article Azure Data Studio Retires February 28, 2026 was originally published on Build5Nines. To stay up-to-date, Subscribe to the Build5Nines Newsletter. View the full article
-
- 0 replies
- 16 views
-
-
When we think of use cases like product recommendations, churn predictions, advertising attribution and fraud detection, a common denominator is they all require us to...View the full article
-
- 0 replies
- 15 views
-
-
Introduction AI/BI Dashboards and Genie are evolving at a breakneck pace. In this roundup, we’ll highlight the most impactful updates from the past three months...View the full article
-
- 0 replies
- 13 views
-
-
Building an end-to-end AI or ML platform often requires multiple technological layers for storage, analytics, business intelligence (BI) tools, and ML models in order to...View the full article
-
- 0 replies
- 15 views
-
-
Summary Databricks will be at GDC this year, demonstrating how game teams can de-risk their development and better know and grow their player base like...View the full article
-
- 0 replies
- 16 views
-
-
In today's fast-paced business environment, the ability to quickly access and analyze data is crucial for maintaining a competitive edge. As North America's largest book...View the full article
-
- 0 replies
- 12 views
-
-
Book at meeting with Databricks at MWC 2025! As we approach Mobile World Congress (MWC) 2025, the telecommunications industry is poised for a transformative leap...View the full article
-
- 0 replies
- 13 views
-
-
For utility companies such as Xcel Energy, wildfire mitigation is critical to protecting electrical infrastructure and minimizing the risk of utility-related ignition events. Typical mitigation...View the full article
-
- 0 replies
- 14 views
-
-
Introduction Stateful stream processing refers to processing a continuous stream of events in real-time while maintaining state based on the events seen so far. This...View the full article
-
- 0 replies
- 14 views
-
-
The Future of Data and AI Belongs to Open and Portable Platforms The promise of AI has never been greater. As organizations race to...View the full article
-
- 0 replies
- 12 views
-
-
Databricks SQL continues to evolve with new features and performance improvements designed to make it simpler, faster, and more cost-efficient. Built on the lakehouse architecture...View the full article
-
- 0 replies
- 13 views
-
-
Finetuning Embedding Models for Better Retrieval and RAG TL;DR: Finetuning an embedding model on in-domain data can significantly improve vector search and retrieval-augmented generation (RAG)...View the full article
-
- 0 replies
- 16 views
-