Data Engineering & Data Science
Data Engineering
Data Pipelines (ETL/ELT)
Big Data Technologies
Cloud Computing for Data
Data Governance & Quality
Data Science
Machine Learning (ML)
Statistical Analysis
Data Visualization
Natural Language Processing (NLP)
1,048 topics in this forum
-
Databricks announced the public preview of Mosaic AI Agent Framework & Agent Evaluation alongside our Generative AI Cookbook at the Data + AI... View the full article
-
- 0 replies
- 44 views
-
-
Mixture-of-Experts (MoE) has emerged as a promising LLM architecture for efficient training and inference. MoE models like DBRX , which use multiple expert... View the full article
-
- 0 replies
- 36 views
-
-
Generative AI (GenAI) can unlock immense value. Organizations are cognizant of the potential but wary of the need to make smart choices about... View the full article
-
- 0 replies
- 44 views
-
-
Relational databases, such as MySQL, have traditionally helped enterprises manage and analyze massive volumes of data effectively. However, as scalability, real-time analytics, and seamless data integration become increasingly important, contemporary data systems like Snowflake have become strong substitutes. After experimenting with a few different approaches and learning from my failures, I’m excited to share my […]View the full article
-
- 0 replies
- 28 views
-
-
Data generated from various sources can make it challenging to integrate and leverage it to make sound, data-driven decisions efficiently. Oracle data integration, part of the broader Oracle Integration suite, offers a comprehensive set of tools and services for effective data ingestion and processing. The platform offers features for building, deploying, and managing real-time data […]View the full article
-
- 0 replies
- 40 views
-
-
Oracle data load is an essential process for organizations wanting to import and manage large volumes of data within Oracle databases. This process helps keep the Oracle cloud applications, Oracle E-Business Suite (EBS), and Oracle Autonomous database up-to-date with the latest data. You can facilitate load with various tools that offer user-friendly interfaces. These tools […]View the full article
-
- 0 replies
- 32 views
-
-
Introduction Financial institutions face a demanding environment with complex regulatory examinations and a pressing need for flexible and comprehensive risk management solutions. The... View the full article
-
- 0 replies
- 40 views
-
-
Today, we are thrilled to announce the general availability of Databricks Assistant and AI-Generated Comments on all cloud platforms . Our mission at... View the full article
-
- 0 replies
- 34 views
-
-
The recent Data + AI Summit 2024 was our biggest ever. Over 16,000 of our top customers, prospects, and partners attended in person... View the full article
-
- 0 replies
- 37 views
-
-
We’re excited to introduce a revamped Catalog Explorer to streamline your day to day interactions, now live across your Unity Catalog-enabled workspaces. The... View the full article
-
- 0 replies
- 41 views
-
-
Are you looking for a simple method to set up real-time replication for data in your Oracle database? If yes, you are in the right place. Real time replication is a typical requirement while using Oracle as a transactional system so that ETL workloads can run based on this replicated instance without putting pressure on […]View the full article
-
- 0 replies
- 26 views
-
-
Effective data management requires accurate data capture, storage, processing, and analysis. Date and time values are critical in organizing and filtering data, providing a foundation for efficient data processing. Oracle’s EXTRACT function helps you obtain specific data/time value components within the Oracle database. The function facilitates precise calculations for data/time values, which can be used […]View the full article
-
- 0 replies
- 34 views
-
-
Organizations deal with data collected from multiple sources, which increases the complexity of managing and processing it. Oracle offers a suite of tools that helps you store and manage the data, and Apache Spark enables you to handle large-scale data processing tasks. The Oracle Spark connector enables data transfer between Apache Spark and Oracle. This […]View the full article
-
- 0 replies
- 32 views
-
-
This article gives information about Snowflake master data management, which you can use to enhance your business revenue. What is Master Data Management? Master data management (MDM) uses various tools and techniques to organize and structure master data in a standardized format. It combines data management practices such as data ingestion, integration, modeling, or governance […]View the full article
-
- 0 replies
- 28 views
-
-
Your organization may store large volumes of data in a single Snowflake data warehouse. However, extracting data specific to individual departments from the data warehouse can be time-consuming, delaying your analytical and business intelligence tasks. To address this issue, consider building a data mart within Snowflake. These separate data marts allow your organization’s departments to […]View the full article
-
- 0 replies
- 35 views
-
-
You can use data warehouses or data lakes as a repository for data management and analytics tasks. Both of these solutions have their advantages and disadvantages. A data warehouse is the best if your organization works only with structured data. Data lake is a suitable choice if your work is based entirely on raw or […]View the full article
-
- 0 replies
- 26 views
-
-
This article comprehensively explains the Snowflake MAX date operations through various example use cases. What is Snowflake MAX? The window function operates on a group of related rows called windows to return one output row for each input row. The syntaxes of MAX for aggregate and window functions are as follows: Here is an example […]View the full article
-
- 0 replies
- 31 views
-
-
As your business grows, so does the complexity of your data ecosystem. In today’s data-driven world, managing and integrating this massive volume of data is critical yet challenging. You need a powerful tool and a solution to streamline your data management. Oracle developed the GoldenGate to address this data management issue. Its real-time capability, high […]View the full article
-
- 0 replies
- 28 views
-
-
Storage costs are important for any business that deals with large amounts of data daily. These costs are influenced by factors like a storage device’s speed, capacity, reliability, and security and can significantly impact business performance, efficiency, and profitability. Snowflake is a cloud data warehouse with many features to help you store and manage data […]View the full article
-
- 0 replies
- 32 views
-
-
Many organizations often ponder if Snowflake, a modern data analytics platform, is a data warehouse or database. Data warehouse and database are two key components of Snowflake’s architecture. Both play distinct but complementary roles, helping you store, process, and analyze data efficiently. The Snowflake warehouse is designed to provide you with scalable and efficient computing […]View the full article
-
- 0 replies
- 30 views
-
-
Thousands of data architects, engineers, and scientists met at Data + AI Summit in San Francisco to hear from industry luminaries like Fei... View the full article
-
- 0 replies
- 41 views
-
-
Today, in microservices architecture, a large number of applications are communicating with each other. Thus, application performance monitoring is useful for debugging a single application. However, when an application expands into multiple services, it is important to know the time taken by each service, at what stage the exception occurs, and the system’s overall health. […]View the full article
-
- 0 replies
- 34 views
-
-
Manually Tracking Sales-based Leads and collecting data from Customer Interactions, Social Media, Emails, etc. can be a cumbersome task, especially when your customer base is growing at an exponential rate. This can be streamlined by an autonomous tool like Salesforce. Salesforce is a Customer Relationship Management(CRM) software company based out of San Francisco. Salesforce provides […]View the full article
-
- 0 replies
- 33 views
-
-
Organizations face a discernible lag in performance with the ever-increasing rise in data. Traditional data warehouses become a financial burden with time despite proper planning as companies also suffer storage limitations. However, Amazon rolled out Redshift, providing a cloud-based data warehouse solution that not only addresses data storage and processing issues but also integrates with […]View the full article
-
- 0 replies
- 31 views
-
-
With most companies adopting cloud as their primary choice of storing data, the need for having a powerful and robust cloud data warehouse is on the rise. One of the most popular cloud-based data warehouse that meets all these requirements is Google’s BigQuery data warehouse. It allows users to store potentially TBs of data with […]View the full article
-
- 0 replies
- 38 views
-
-
Debezium is the database monitoring platform that continuously captures and streams all real-time modifications updated on the respective database systems like MySQL and PostgreSQL. Usually, developers use CLI tools like the default command prompt terminal to work with Debezium, which is the traditional way of setting up the Debezium workspace. To begin working with Debezium, […]View the full article
-
- 0 replies
- 40 views
-
-
Salesforce is a subscription-based customer relationship management software that is offered as a completely managed cloud service. Salesforce revolutionized the CRM space by sparing customers the effort of developing custom software or maintaining installations of third-party software. In this blog post, we will discuss how to create Custom Salesforce Reports. Prerequisites Introduction to Salesforce Salesforce […]View the full article
-
- 0 replies
- 30 views
-
-
Today, a combination of Debezium and Kafka is embraced by organizations to record changes in databases and provide information to subscribers (other applications). In this article, you will learn about Kafka Debezium, features of Debezium, and how to perform event sourcing using Debezium and Kafka. Prerequisites What is Kafka? Initially developed by LinkedIn, Kafka is […]View the full article
-
- 0 replies
- 39 views
-
-
Radiology is an important component of diagnosing and treating disease through medical imaging procedures such as X-rays, computed tomography (CT), magnetic resonance imaging... View the full article
-
- 0 replies
- 41 views
-
-
Enhancing DLT development experience is a core focus because it directly impacts the efficiency and satisfaction of developers building data pipelines with DLT... View the full article
-
- 0 replies
- 36 views
-
-
In the insurance sector, customers demand personalized, fast, and efficient service that addresses their needs. Meanwhile, insurance agents must access a large amount... View the full article
-
- 0 replies
- 45 views
-
-
We are excited to announce that Gartner has recognized Databricks as a Leader in the 2024 Gartner® Magic Quadrant™ for Data Science and... View the full article
-
- 0 replies
- 44 views
-
-
Data analytics helps to derive valuable insights from your raw data. It helps you align your business processes for better outcomes by identifying trends and patterns in the data that would otherwise be lost. As your business accumulates large amounts of data, the challenge lies in implementing an efficient data analytics process that can help […]View the full article
-
- 0 replies
- 55 views
-
-
Different systems and databases use various date formats. Converting date data into a consistent format will ensure accuracy across systems. For instance, you are collecting sales data from other regions that use different formats. Combining and analyzing the sales data would be time-consuming and error-prone if the data is not standardized. By converting all the […]View the full article
-
- 0 replies
- 45 views
-
-
Azure Data Factory (ADF) is a Microsoft-managed data integration solution that facilitates the creation of cloud-based data workflows. It is a fully managed service that can be used to build data pipelines by orchestrating data movement. Snowflake is a fully managed SaaS (Software-as-a-Service) tool that offers cloud-based data warehouse services. It provides multi-cloud support and […]View the full article
-
- 0 replies
- 49 views
-
-
Organizations often struggle with data silos and inconsistencies due to customer data being dispersed across multiple systems. Such scattered data can hinder the ability to make informed, data-driven decisions. Platforms like Salesforce and Snowflake help address these challenges by unifying customer data and robust analytics. A Snowflake Salesforce integration offers real-time access to data for […]View the full article
-
- 0 replies
- 46 views
-
-
Schema management is crucial for ensuring data quality and consistency in a database. One prominent feature it enables is version control and change management. Version control helps maintain the history of schema versions, allowing an efficient way to track the changes made to the schema. To achieve this, you can use Schemachange, an open-source change […]View the full article
-
- 0 replies
- 43 views
-
-
How do you visualize your Snowflake data? Snowsight, the visual interface of Snowflake, allows two different easy ways to visualize your data within Snowflake- by using charts or dashboards. If you have large data volume and all the data from different sources are centralized to Snowflake, both of these methods will be very useful to […]View the full article
-
- 0 replies
- 42 views
-
-
Your organization might store sensitive data such as identification numbers, date of birth, or account numbers in Snowflake data warehouse tables. To ensure this information is accessible only to authorized people with appropriate roles, Snowflake supports column-level security through dynamic data masking policies. With the Snowflake data masking feature, sensitive columns within tables or views […]View the full article
-
- 0 replies
- 40 views
-
-
The constant increase in the data produced by modern technologies has given rise to significant challenges, such as data complexity, inconsistencies, and breaching issues. You need a structured approach to address these challenges and mitigate the risk of comprising sensitive data. What is Data Governance in Snowflake? While Snowflake enables you to handle increasing volumes […]View the full article
-
- 0 replies
- 41 views
-
-
Snowflake is a cloud-based platform that manages large data workloads in virtual warehouses. It is known for its unique architecture and pricing model. Snowflake charges you for the compute resources, storage, and data transfer services you utilize. However, most of the costs on your Snowflake bill are based on the compute resources you use rather […]View the full article
-
- 0 replies
- 50 views
-
-
Snowflake is a cloud data warehousing solution that has become popular for companies with large data volumes. However, moving databases from an existing data platform to Snowflake can be complicated. You may face challenges in adapting existing pipelines that require custom code or integrating data from legacy systems to Snowflake’s environment. This article will provide […]View the full article
-
- 0 replies
- 51 views
-
-
Today, we are excited to announce Databricks LakeFlow, a new solution that contains everything you need to build and operate production data pipelines... View the full article
-
- 0 replies
- 46 views
-
-
Discover how RudderStack reduced Customer Success response times by 50% using LLMs integrated with kapa.ai and Thena View the full article
-
- 0 replies
- 45 views
-
-
At Databricks, our mission is to democratize data + AI. An open approach to sharing and collaboration is critical to maximize reach and... View the full article
-
- 0 replies
- 46 views
-
-
We are excited to announce that we are open sourcing Unity Catalog, the industry’s first open source catalog for data and AI governance... View the full article
-
- 0 replies
- 40 views
-
-
In an era marked by rapid advancements in artificial intelligence and an explosion of data and Gen AI tools, enterprises face fragmented data... View the full article
-
- 0 replies
- 46 views
-
-
The annual Data Team Awards showcase how different data teams from across the globe are delivering solutions to some of the world’s most... View the full article
-
- 0 replies
- 53 views
-
-
“FactSet’s mission is to empower clients to make data-driven decisions and supercharge their workflows and productivity. To deliver AI-driven solutions across our entire... View the full article
-
- 0 replies
- 47 views
-
-
Today, we are excited to announce Databricks AI/BI , a new type of business intelligence product built from the ground up to deeply... View the full article
-
- 0 replies
- 48 views
-