Data Engineering & Data Science
Data Engineering
Data Pipelines (ETL/ELT)
Big Data Technologies
Cloud Computing for Data
Data Governance & Quality
Data Science
Machine Learning (ML)
Statistical Analysis
Data Visualization
Natural Language Processing (NLP)
1,018 topics in this forum
-
Let’s talk about data engineers’ nightmare Continue reading on Towards Data Science » View the full article
-
- 0 replies
- 9.6k views
-
-
Readers Digest to Learn Data Engineering Gradually Continue reading on Towards Data Science » View the full article
-
- 0 replies
- 8.7k views
-
-
My personal take on justifying the existence of Data MeshA senior stakeholder at one my projects mentioned that they wanted to decentralise their data platform architecture and democratise data across the organisation. When I heard the words ‘decentralised data architecture’, I was left utterly confused at first! In my then limited experience as a Data Engineer, I had only come across centralised data architectures and they seemed to be working very well. So, I was left wondering what was it that we wanted to solve using a decentralised data architecture? Or were we creating a new problem that did not ever exist in the first place? .. A Prequel to Data Mesh was originally…
-
- 0 replies
- 6.4k views
-
-
In the dynamic realm of AI-driven forecasting, businesses navigate a landscape where strategic choices shape their trajectory. One such pivotal decision was made... View the full article
-
- 0 replies
- 6k views
-
-
This post is part of a series. Check out Part 1: The Data + AI Trifecta: People, Process, and Platform In the current... View the full article
-
- 0 replies
- 6k views
-
-
Solr is an open-source, highly scalable search platform built on top of Apache Lucene. It provides powerful capabilities for searching, indexing, and faceting large amounts of data. Here are 10 real use cases of Solr: Apache Solr is an open-source search platform built on Apache Lucene, which is a high-performance, full-text search engine library. Solr is widely used for enterprise search and analytics purposes because it provides robust full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (like Word and PDF) handling capabilities. It is designed to handle large volumes of text-centric data and provides distribute…
-
- 0 replies
- 4.9k views
-
-
RudderStack explains how to churn prediction can happen using Google’s BigQueryML together with the clickstream data gathered and delivered using the stage.View the full article
-
- 0 replies
- 3.9k views
-
-
As a data scientist or machine learning engineer, you’re constantly challenged with building accurate models and deploying and scaling them effectively. The demand for AI-driven solutions is skyrocketing, and mastering the art of scaling machine learning (ML) applications has become more critical than ever. This is where Kubernetes emerges as a game-changer, often abbreviated as K8s. In this blog, we’ll see how you can leverage Kubernetes to scale machine learning applications... The post Tensorflow or PyTorch + K8s = ML apps at scale appeared first on Amazic. View the full article
-
- 0 replies
- 3.2k views
-
-
"AFROTECH was not only insightful, but also greatly heightened my sense of belonging in the tech space! It was amazing to both make... View the full article
-
- 0 replies
- 2.8k views
-
-
Introduction On January 4th, a new era in digital marketing began as Google initiated the gradual removal of third-party cookies, marking a seismic... View the full article
-
- 0 replies
- 2.5k views
-
-
A headless customer data platform architecture is the connective tissue between customer data, the cloud data warehouse, and the rest of the stack.View the full article
-
- 0 replies
- 2.4k views
-
-
The observation that "software is eating the world" has shaped the modern tech industry. Today, software is ubiquitous in our lives, from the... View the full article
-
- 0 replies
- 2.4k views
-
-
RudderStack Profiles enables teams to build scalable, transparent identity graphs in their warehouse—without managing complex SQL.View the full article
-
- 0 replies
- 2.2k views
-
-
How to build a data stack to 1) centralize every data point in a single storage/compute layer, and 2) make this layer accessible to every downstream tool.View the full article
-
- 0 replies
- 2.2k views
-
-
A summary of our Data Quality Toolkit, a set of features to help you guarantee customer data quality from the source. View the full article
-
- 0 replies
- 2.1k views
-
-
Read RudderStack CEO Soumyadeb Mitra's insights on the changes ahead in the field of data as the data engineering megatrend impacts every industry. View the full article
-
- 0 replies
- 2k views
-
-
This blog was written in collaboration with Tim Sedlak, Senior Solutions Architect at Stardog In healthcare and life sciences, accuracy is everything. That's... View the full article
-
- 0 replies
- 1.8k views
-
-
Access all of Datacamp's 460+ data and AI courses, career tracks & certifications ... https://www.datacamp.com/freeweek
-
- 0 replies
- 1.7k views
-
-
We are excited to share new identity and access management features to help simplify the set-up and scale of Databricks for admins. Unity... View the full article
-
- 0 replies
- 1.4k views
-
-
Special thanks to Barb MacLean, SVP, Head of Technology Operations and Implementation at Coastal Community Bank (Coastal) and Rob Cavallo, President at Cavallo... View the full article
-
- 0 replies
- 1.3k views
-
-
Amazon Redshift is a serverless, fully managed leading data warehouse in the market, and many organizations are migrating their legacy data to Redshift for better analytics. In this blog, we will discuss the best Redshift ETL tools that you can use to load data into Redshift. 8 Best Redshift ETL Tools Let’s have a detailed […]View the full article
-
- 0 replies
- 1.2k views
-
-
The GGUF file format is a binary file format used for storing and loading model weights for the GGML library. The library documentation... View the full article
-
- 0 replies
- 1.1k views
-
-
In 2021, we wrote about trends we saw emerging in data engineering and made a few predictions. Here, we revisit those predictions and make a few for 2022. View the full article
-
- 0 replies
- 835 views
-
-
We are excited to announce the upcoming general availability of Azure Private Link support for Databricks SQL (DBSQL) Serverless, planned in April 2024... View the full article
-
- 0 replies
- 802 views
-
-
In this definitive guide, we aim to streamline your understanding of API integration, covering all aspects from different types of APIs. View the full article
-
- 0 replies
- 780 views
-
-
With Braze Deduplication, you can automatically prevent sending duplicate data to Braze to avoid overages. View the full article
-
- 0 replies
- 766 views
-
-
This article explains the concept of regularization and its significance in machine learning and deep learning. We have discussed how regularization can be used to enhance the performance of linear models, as well as how it can be applied to improve the performance of deep learning models.View the full article
-
- 0 replies
- 763 views
-
-
An effective campaign can help improve a company's revenue by increasing the sales of its products, clearing out more stock, bringing in more... View the full article
-
- 0 replies
- 712 views
-
-
RudderStack acquires Blendo: Kostas Pardalis, the founder of Blendo, talks about why he decided to merge Blendo with RudderStack.View the full article
-
- 0 replies
- 708 views
-
-
Sync data from Trino to business tools with our integration. Warehouse-based diffing makes it the most performant Trino Reverse ETL solution on the market.View the full article
-
- 0 replies
- 701 views
-
-
RudderStack Transformations allow you to transform data in-flight with custom JavaScript so you can customize integrations, fix bad data, and enrich events.View the full article
-
- 0 replies
- 619 views
-
-
RudderStack is now more secure than ever before. Here's how we received our SOC 2 Type 1 certification. Click for more.View the full article
-
- 0 replies
- 589 views
-
-
RudderStack Predictions makes it easy for you to build predictive features in Snowflake without additional MLOps and infrastructure.View the full article
-
- 0 replies
- 553 views
-
-
Governance ensures data and AI products are consistently developed and maintained, adhering to precise guidelines and standards. It's the blueprint for architects, bringing... View the full article
-
- 0 replies
- 529 views
-
-
User segmentation is a versatile strategy that can be applied across various industries and business functions. See our definition of what are user segments.View the full article
-
- 0 replies
- 508 views
-
-
As Chief Scientist (Neural Networks) at Databricks, I lead our research team toward the goal of giving everyone the ability to build and... View the full article
-
- 0 replies
- 467 views
-
-
Learn how to architect a data stack that unlocks real-time personalization use cases, enabling you to customize user experiences in-session.View the full article
-
- 0 replies
- 465 views
-
-
Now in preview, AWS Glue Elastic Views is a new capability of AWS Glue that makes it easy to build materialized views that combine and replicate data across multiple data stores without you having to write custom code. With AWS Glue Elastic Views, you can use familiar Structured Query Language (SQL) to quickly create a virtual table—a materialized view—from multiple different source data stores. AWS Glue Elastic Views copies data from each source data store and creates a replica in a target data store. AWS Glue Elastic Views continuously monitors for changes to data in your source data stores, and provides updates to the materialized views in your target data stores autom…
-
Track sessions on web and mobile with RudderStack to get complete control over how session data is tracked.View the full article
-
- 0 replies
- 424 views
-
-
Organizations use ETL (Extract, Transform, and Load) to obtain quality data for expediting decision-making. But, the myriad of available ETL tools makes it challenging for organizations to evaluate and embrace the right tool. Today, ETL tools are divided into various types, making it even more difficult for companies to find the right fit. In this […]View the full article
-
- 0 replies
- 408 views
-
-
Building the data foundation for AI starts with data quality, completeness and accessibility. Learn how you can achieve AI-readiness with RudderStack.View the full article
-
- 0 replies
- 399 views
-
-
This post is the second part of our two-part series on the latest performance improvements of stateful pipelines. The first part of this... View the full article
-
- 0 replies
- 396 views
-
-
Large language models (LLMs) have generated interest in effective human-AI interaction through optimizing prompting techniques. “Prompt engineering” is a growing methodology for tailoring... View the full article
-
- 0 replies
- 369 views
-
-
This article represents a collaborative effort between Plotly, Ballard Power Systems, and Databricks. Fleets of buses worldwide run on hydrogen fuel cells made... View the full article
-
- 0 replies
- 368 views
-
-
How Customer 360 is applied across various industries and how every business team can utilize a customer 360 to drive better business outcomes. View the full article
-
- 0 replies
- 358 views
-
-
Introduction Four months ago, we shared how AMD had emerged as a capable platform for generative AI and demonstrated how to easily and... View the full article
-
- 0 replies
- 337 views
-
-
Reverse ETL Audiences empowers business teams to leverage warehouse data through a no-code audience builder. View the full article
-
- 0 replies
- 295 views
-
-
What is a data dictionary? Learn about the data dictionary to improve your data governance and quality. View the full article
-
- 0 replies
- 245 views
-
-
Six simple steps to send your event data from RudderStack from any source to any destination to gain business insights using customer data analytics.View the full article
-
- 0 replies
- 212 views
-
-
Get an overview of our data stack with details on how we use our own tool to move data throughout the stack. In this post we focus on marketing use cases.View the full article
-
- 0 replies
- 201 views
-