Postgres Conference Program
presented by Marshall Presser, Andreas Scherbaum, and Craig Sylvester
It's more than just storing and retrieving data. Equally important are loading high volume data in parallel and running analytics in the database. This hands-on session will lead you through the entire process of creating, loading, and analyzing data in the Greenplum MPP database. It's PostgreSQL, but bigger and DWH-focused.
At the end of this workshop, attendees will learn modern DWH techni...
more 09:00 - 16:00 Sugar Hill Greenplum SummitPostgreSQL Tutorial
presented by Avinash Vallarapu
Abstract
- This tutorial is focused on teaching PostgreSQL to professionals that are already experienced with Oracle or MySQL. As we discuss from basics, even a beginner may adapt the concepts easily. Relational databases share a lot of similar concepts. You may find it easier if we help you relate the concepts of PostgreSQL with Oracle/MySQL. Thus, we would start with PostgreSQL A...
presented by Kevin Jernigan
On AWS, you can manage PostgreSQL yourself on EC2 or you can choose one of the managed options such as RDS PostgreSQL or Amazon Aurora with PostgreSQL compatibility. In this session, you will learn about each of the options so you can choose the best PostgreSQL deployment option for your use case.
09:00 - 09:50 Grammercy AWS Migration DayTuning PostgreSQL isn't as hard as you think
presented by Malcolm McLean
Presented as full-day training
Out of the box PostgreSQL comes with very conservative performance settings. In this workshop, we'll look at many of the options that relate to performance and how to set them for your environment, as well as query optimization, statistics, operating system parameters and server resources.
There is no one-size-fits-all approach, but through this workshop, ...
more 09:00 - 16:00 Bowery Ops and AdministrationHands On Labs
presented by Alvaro Hernandez
Stop paying for Larry’s boat!
PostgreSQL is a fully-featured, enterprise-grade and open source database, named two years in a row database of the year! And it’s also the best candidate to migrate off of Oracle, supporting very advanced SQL, easy administration and pl/pgsql, a replacement for Oracle’s PL/SQL. Stop spending hundreds of thousands or millions of $ on database licenses and rein...
more 09:00 - 16:00 Murray Hill Ops and Administrationpresented by Jim Mlodgenski
Amazon Aurora with PostgreSQL compatibility is a relational database service that combines the speed and availability of high-end commercial databases with the simplicity and cost-effectiveness of open-source databases. We review the functionality in order to understand the architectural differences that contribute to improved scalability, availability, and durability. We also dive deep into th...
more 10:00 - 10:50 Grammercy AWS Migration Daypresented by James Finnerty
Amazon Aurora offers several options for monitoring and optimizing PostgreSQL database performance. These include Enhanced Monitoring and Performance Insights, an easy-to-use tool for assessing the load on your database and identifying slow-performing queries. With this information, you can use Query Plan Management to improve your database response. In this session, learn how to tune the perfo...
more 11:00 - 11:50 Grammercy AWS Migration Daypresented by PostgresConf Organizers
12:00 - 13:00 Lower Level Foyer
presented by David Rader
When enterprises move to PostgreSQL from commercial databases such as Oracle, SQL Server and DB2, there are many processes and procedures that need to be migrated along with the applications. In this talk we will discuss the overall migration process and how technical changes from migrations affect long-standing aspects of an enterprise.
13:00 - 13:50 Grammercy AWS Migration Daypresented by Craig Silveira
As any handy man knows, having the right tool in your toolbox can make or break a challenging job. The same can be said of database migrations. Having access to a good set of tools to help you at each step of the migration process is critical. In this talk, you will learn about some of those tools offered by AWS. The tools to be discussed are : Workload Qualification Framework(WQF), Schema ...
more 14:00 - 14:50 Grammercy AWS Migration DayA migration story of hundreds of databases
presented by Brent Bigonger
In this session, learn from the team that migrated Amazon’s inventory and fulfillment management systems (AFT) from Oracle to Amazon Aurora with PostgreSQL compatibility. We focus on the performance and scalability benefits to enterprises that migrate critical systems from Oracle to AWS services, the decision frameworks used to pick Amazon Aurora with PostgreSQL compatibility and best practices...
more 15:30 - 16:20 Grammercy AWS Migration Daypresented by Kevin Jernigan
Amazon Aurora is a PostgreSQL-compatible relational database with the speed, reliability, and availability of commercial databases at one-tenth the cost. Bring your questions about any Aurora topic or PostgreSQL at AWS.
16:30 - 17:20 Grammercy AWS Migration Daypresented by Eric Sheetz
High level introduction to Google Cloud Platform and it’s storage portfolio.
09:00 - 09:50 Bowery Google Cloud Summitpresented by Scott Mead
In this tutorial we will be combining an extended deep dive on Amazon Relational Database Service along with a hands on lab using Amazon RDS for PostgreSQL. As part of the deep dive we will cover the unique features of Amazon RDS for improved availability and durability. In addition, you will get hands on experience creating an Amazon RDS for PostgreSQL for high availability and read scaling as...
more 09:00 - 12:00 Chelsea Ops and Administrationpresented by Keaton Adams
Welcome to Greenplum Summit 2019! We are excited to come together once again to share insights and updates on the latest advances to the world's leading fully-featured, multi-cloud, open-source, Postgres-based, Massively Parallel Advanced Analytical database. In the presentations and technical deep-dives that are in the lineup for this year's Summit, you will discover just how far Greenplum has...
more 09:00 - 09:20 Grammercy Greenplum SummitFeatures, Tools and Usage
presented by Simon Riggs
Covers the essential tools and utilities required by Financial Industry professionals. * High Availability (with and without BDR) * Online upgrades * Auditing & Certification * PCI-Compliant Security * Client tools and Monitoring * Performance tooling Covers both core PostgreSQL and 2ndQuadrant products and services.
09:00 - 12:00 Sugar Hill Ops and AdministrationWhat PostGIS can do for me
presented by Leo Hsu
This training will teach you how to use PostGIS related extensions with main focus on the postgis, pgrouting and ogr_fdw spatial extensions. We will also cover techniques for getting the most performance out of your spatial queries. Though most queries will also work on lower versions, we will be using PostgreSQL 11+, PostGIS 2.5+ , and pgRouting 2.6+. We will highlight functions/spatial tricks...
more 09:00 - 09:50 Sutton Place Developmentpresented by Diana Hsieh
TimescaleDB is an open-source time-series database, implemented as a Postgres extension, that improves insert rates by 20x over vanilla Postgres and offers much faster queries, while natively supporting full SQL (including JOINs). TimescaleDB achieves this by storing data on an individual server in a manner more common to distributed systems: heavily partitioning the data, stored in hypertables...
more 09:00 - 12:00 Murray Hill Ops and Administrationpresented by Ivan Novick
Greenplum Database is at the forefront of global R&D for large-scale big data and analytics use cases. In this session, we will outline the new capabilities and power in Greenplum Database Version 6, as well as summarize the ongoing engineering work in progress including Postgres merging, analytics in a post-Hadoop world, GPU acceleration, high concurrency mixed workloads, Apache Kafka integrat...
more 09:30 - 09:50 Grammercy Greenplum Summitpresented by Kurtis Van Gent
Hands on demonstration building and deploying a highly available, fault tolerant web application. In this tutorial, we will demonstrate how to set up a connection from an python application running in Google Kubernetes Engine to a Cloud SQL instance, using private IP or the Cloud SQL Proxy Docker image.
10:00 - 12:00 Bowery Google Cloud Summitpresented by Frank McQuillan and Sridhar Paladugu
Advanced analytics and machine learning are rapidly growing in importance in enterprise computing. Key enterprise data typically resides in relational form, and it is inefficient to copy data between systems to perform analytical operations.
In addition to leveraging the rich set of Postgres analytics like window functions, Greenplum offers machine learning, graph analytics, statistics, an...
more 10:00 - 10:50 Grammercy Greenplum SummitLet me count the ways
presented by Regina Obe
PostgreSQL is one of the easiest databases to get data into and out of. In addition there are tons of free useful data sources to munch on.
We'll cover various ways of loading various kinds of data into PostgreSQL. Data formats will include other postgres data, delimited, fixed width, spreadsheets, other relational databases, and other formats (including spatial formats) you may come across...
more 10:00 - 10:50 Sutton Place Datapresented by Scott Hajek
Data scientists today expect to work with tools that have good abstractions and interfaces. Pure SQL is not the best interface for data science, but the power and scale of SQL-based systems can be beneficial. This talk introduces a modern interface for Postgres and Greenplum that appeals to data scientists.
The importance of good abstractions and interfaces can be seen in the dominance of R...
more 11:00 - 11:20 Grammercy Greenplum Summitand some comparisons with PostgresXL and Citusdata
presented by Pilar de Teodoro
When there is no option to continue scaling up resources, there is a need for scaling out. At the ESA science data center (ESDC) we envisage a growth of the archive data stored in our databases of about 50TB in 2 years. The current technology used, which is vanilla PostgreSQL will not be enough. In order to fulfill the user requirements for the different missions with such big amounts of data,...
more 11:30 - 11:50 Grammercy Greenplum SummitLunch!
presented by PostgresConf Organizers
The Tuesday Lunch, Be there or be square!
12:00 - 13:00 Lower Level Foyerpresented by Sharath Punreddy and Niranjan Sarvi
Real time actionable insights have become vital for business success, and Apache Kafka is the de facto standard for near real time data integration for high data volumes. Greenplum-Kafka connector is high-speed, parallel data transfer utility from Kafka to Greenplum. In this session, we will demonstrate the real time streaming using Greenplum-Kafka connector. The presentation also includes the ...
more 13:00 - 13:20 Grammercy Greenplum SummitApproaches and Configuration
presented by Abbas Butt
#Replication in PostgreSQL - Deep Dive
The objective of this tutorial is to familiarize the participants with replication in PostgreSQL. Specifically this tutorial will focus on what approaches are available and how to use them. For each replication approach that is available in PostgreSQL the participants will learn how to configure a two node cluster using that approach and how to perfo...
more 13:00 - 16:00 Sutton Place Ops and AdministrationFast and efficient DBA skills
presented by Simon Riggs
Update your PostgreSQL DBA knowledge to the latest version with this fast-paced training, showing how some important topics have changed over the last few releases. - Tables and Data - Scripts and testing - Performance - Maintenance Suitable for users of cloud and on-premise PostgreSQL databases. Uses excerpts from the book "PostgreSQL11 Database Admin Cookbook", presented directly by the...
more 13:00 - 16:00 Chelsea Ops and AdministrationPL/pgSQL Edition
presented by Jim Mlodgenski
When developing against a PostgreSQL database, many times you will use SQL from your applications, but sometimes you need more. You might need to run some of your logic on the database server for performance or you may want to just create some server side functions to simplify your application development. There are an infinite number of reasons why you would want to write some code that runs o...
more 13:00 - 16:00 Murray Hill DevelopmentAutomating PostgreSQL failovers, the sane way.
presented by Alexander Kukushkin and Alexey Klyukin
You can get slides at https://github.com/patroni-training/2019/raw/master/slides.pdf
Postgres has a rock-solid single master physical replication, but lacks built-in failover. A number of open-source and proprietary tools aim to fill this niche. Patroni stands apart because it doesn’t try to solve all hard problems by itself. ...
more 13:00 - 16:00 Sugar Hill Ops and Administrationpresented by David Yahalom and Kavitha Gopalakrishnan
Demonstrate how to migrate from Oracle to Cloud SQL for PostgreSQL using easy-to-use migration assessment tool.
13:00 - 13:50 Bowery Google Cloud Summitpresented by Jayaraman Thiagarajan and Venkat Gullapalli
Baker Hughes, a leading oil & gas company, established its Big Data presence by setting up its mission-critical Data Lake on AWS with the consolidation and migration of enterprise data from 45+ data sources, including ERP and non-ERP data flowing into Greenplum Database on AWS amounting to petabyte size storage volume in a highly complex computing environment.
The challenges included the...
more 13:30 - 13:50 Grammercy Greenplum Summitpresented by Steve Hetzel
Tech talk around what it would take to run and manage a self hosted Postgres instance on a Compute VM.
14:00 - 14:50 Bowery Google Cloud Summitpresented by Gregory Chase, Ailun Qin, Eran Shaked, Ronald Smith, and Dmitriy Pavlov
This panel will feature three leaders from organizations that run extensive operations teams that manage Greenplum Database at large scale for production and business critical use cases. We will dig into pressing issues for operations leaders as they look to have stability and order in their deployments.
14:00 - 14:50 Grammercy Greenplum Summitpresented by Oz Basarir
This session will show how Greenplum for Kubernetes brings cloud databases on-premises. We will start with an introduction to the product and the various partners and components that make up the ecosystem of AI, BI, ETL, data preparation and data science tools. Then, we will explain how customers can develop data-driven smart apps using this platform and operationalize AI. Finally, we will prov...
more 15:00 - 15:20 Grammercy Greenplum Summitpresented by Gabi Ferrara
Introductory talk covering Cloud Spanner, a highly scalable globally consistent relational database.
15:00 - 15:20 Bowery Google Cloud SummitAddressing Housing Overflow at Purdue
presented by Ian Pytlarz
With enrollment growing more quickly than our ability to house students, temporary housing was set to grow. In order to reduce the need for this sub-optimal housing, Purdue set about modelling housing contract follow-through to catch students that had no intention to show up on campus and give their housing slots to people in temporary housing, all before any students arrived on campus.
16:00 - 16:20 Grammercy Greenplum Summitpresented by Ben Whitehead
Introductory talk covering Cloud Firestore, a highly scalable globally consistent NoSQL database.
16:00 - 16:20 Bowery Google Cloud Summitpresented by Jon Roberts
Learn about the Pivotal Greenplum in the Cloud Marketplace products as well as the unique, cloud-only benefits.
- Demo deploying
- Use Cases
- Cloud Features
Massively scalable, high throughput NoSQL database
presented by Billy Jacobson
Introductory talk covering a NoSQL approach for large-scale, low-latency, or high throughput database needs.
16:30 - 16:50 Bowery Google Cloud Summitpresented by Shuang Liang
Informal chat with a senior Cloud SQL engineer.
17:00 - 17:20 Bowery Google Cloud SummitPivotal’s Opinionated On-Premise Platform
presented by Derek Comingore
Appliances have been the enterprise standard for retaining and running data warehousing systems for decades. The driving force behind the appliance model’s massive adoption has been simplicity. Enterprise customers have sacrificed both flexibility and openness for simplicity in the appliance era. Pivotal has been busy designing an open and modern reference architecture that encompasses aspects ...
more 17:00 - 17:20 Grammercy Greenplum Summitpresented by Shivram Mani and Francisco Guerrero
Traditional data warehouses are deployed with dedicated on-premise compute and storage. As a result, compute and storage must be scaled together and clusters must be persistently turned on in order to provide data availability at all times. In the cloud, compute and storage can be decoupled by taking advantage of the ability to request on-demand infrastructure. Greenplum in Kubernetes brings th...
more 17:30 - 17:50 Grammercy Greenplum SummitThe What and the How of Making PostgreSQL GDPR Compliant
presented by Karthik Ranganathan
The General Data Protection Regulation, often referred to as GDPR, came into effect on 25 May 2018 across the European Union. This regulation has implications on many global businesses, given the fines imposed if the organization is be found to be non-compliant. Making sure that the app architecture continues to ensure regulatory compliance is an on-going challenge for many businesses that requ...
more 09:00 - 09:50 Bowery Regulated Industrypresented by Oskari Saarenmaa
Aiven PostgreSQL is a fully managed cloud service available in six different public cloud providers' infrastructure in more than 70 regions around the world. This has given us a unique opportunity to benchmark and compare performance of similar configurations in different environments.
We're previously focused benchmarking two different VM sizes with five providers, but now we're also inclu...
more 09:00 - 09:50 Riverside Ballroom Ops and Administrationpresented by Nikolay Samokhvalov
SLIDES: https://docs.google.com/presentation/d/1ZkWboJqzug4q3UYajFdmXOtqyipcuhcQdL4g27GjKeg/edit?usp=sharing
To maintain a Postgres database in a healthy state, it needs periodical comprehensive diagnostics. Being done manually, it might take 1-2 weeks or more, especially if you are seeing...
more 09:00 - 09:50 Grammercy Ops and AdministrationRemote Functions Calls Are Possible
presented by Henrietta Dombrovskaya
At Braviant Holdings we consider foreign data wrappers to be the best thing even introduced in PostgreSQL. We use them virtually everywhere, both for our apps which interact with OLTP databases and for the ETL for our Data Warehouse.
One of the best things about using the FDW is that they allow you to map objects of various types. You can map tables, views, other foreign tables, you can use...
more 09:00 - 09:20 Riverside Suite DevelopmentPostgreSQL as server-side application development platform
presented by Dmitry Dorofeev and Sergey Shestakov
The rise of data-centric computing, NoSQL and newSQL databases with powerful scripting capabilities, popularity of REST API raise a question: is it feasible to serve clients directly from the DB, with REST API server residing inside the database? What would be the balance between data processing, application, and presentation logic for such a scenario on a server side and on a client side? How ...
more 09:30 - 09:50 Riverside Suite Developmentpresented by Amanda Nystrom
People . Postgres . Data Launch!
10:00 - 10:20 New York Ballroom East General SessionPostgres Takes Charge Around the World Behind Strength of Open Source Community
presented by Marc Linster
Presenter: Marc Linster - Senior Vice President of Product Development, EnterpriseDB
Abstract: Perspective on the global economic impact of Postgres development and how the ecosystem investing in core Postgres technology is fueling Postgres longevity and paving the way for its dominance.
The broad expansion of Postgres into enterprises has led to increasingly robust Postgres skills of DBA...
more 10:20 - 10:40 New York Ballroom East General Sessionpresented by Ivan Novick and Jacque Istok
More than 2.5 quintillion bytes of data are created each and every day—and at that rate: Scale Matters. Database workloads at scale are driving some of the most impactful use cases in the world, helping to solve both industry and government’s most interesting problems. Join two of Pivotal’s data thought leaders to hear about how to solve these problems leveraging Postgres at scale, and learn wh...
more 10:50 - 11:10 New York Ballroom East General Sessionpresented by Kevin Jernigan
11:20 - 11:40 New York Ballroom East General Session
presented by Anant Jhingran
Postgres has stood the test of time for around three decades, thanks to the investments in it by developers and by the users. The talk will focus on what are the primary drivers for the next five+ years in the database world, and how the community, by keeping its focus on what got it here, will help steer it in the right directions.
11:40 - 12:00 New York Ballroom East General Sessionpresented by PostgresConf Organizers
Wednesday Lunch!
12:00 - 13:00 Metropolitan Ballroom Eastpresented by Bryan Doyle, John Knapp, and Gregory Chase
Some business verticals have been subject to government regulation for a long time: health care, finance, even the government itself. These days, if you are storing data about people in PostgreSQL, chances are you are subject to at least increasing regulations related to protecting people's privacy. In this panel, we'll talk with PostgreSQL professionals who must comply with regulatory requir...
more 13:00 - 13:50 Bowery Regulated IndustryIntel Non-Volatile Memory Solutions Group
presented by Andrew Ruffin and Frank Ober
In this talk, we'll present information on new storage technologies and how they impact your implementation of PostgreSQL. We'll show results from the PostgreSQL and general open source database community comparing the two major types of NVMe storage SSDs – NAND and Intel Optane technology – found in a growing number of servers running open source databases. We will walk through the various r...
more 13:00 - 13:50 New York Ballroom West Ops and AdministrationInformation Exchange Techniques for Java/PostgreSQL Applications
presented by Alyssa Ritchie and Henrietta Dombrovskaya
Everything shall pass except Object-Relational Impedance Mismatch. Despite many years of research and development in this area, applications and databases still struggle with communication issues. An incompatibility of data models between object-oriented application code and a database engine causes multiple performance problems and triggers over-complicated application design.
In this talk,...
more 13:00 - 13:50 New York Ballroom East DevelopmentHow database proxies optimize query read/write performance with no code changes
presented by Erik Brandsberg and Zack Odom
Learn the various techniques on how Heimdall's Database Proxy improved throughput and performance:
1) Batch DML operations: intelligently process singleton operations as micro-batches. 2) Intelligently route diverse workloads (i.e. OLTP, OLAP) to Postgres, utilizing Postgres's latest features, such as materialized views for analytic purposes. 3) Auto-caches into GemFire or other cache eng...
more 13:00 - 13:50 Riverside Suite Datapresented by Sunil Kamath and Rachel Agyemang
In this session we will begin with a quick overview of the Azure Database for PostgreSQL service and how it achieves durability, high availability and delivers elastic scaling without having to require replica instances. Then we will do a deep dive into the exciting new features we recently released, including Intelligent Performance, Advanced Threat Protection and new Postgres extensions we en...
more 13:00 - 13:50 Riverside Ballroom DataUnderstanding spatial data concepts
presented by Ryan Lambert
The PostGIS extension turns Postgres into the world's most advanced open source spatial database. Not all DBA's are experienced with spatial data or spatial analysis and that can cause misunderstandings and performance problems. The core problem is often a problem with how the spatial data is loaded and provided to analysts. This presentation provides practical examples of how to understand a...
more 13:00 - 13:50 Grammercy DataFrom an RDS PostgreSQL Persepective
presented by Dennis Tighe
RDS PostgreSQL has the features and functionality to help keep your data safe and meet many of the common controls established by organizations. RDS PostgreSQL currently offers a range of features to help you protect your databases. In this talk you will learn some best practices for keeping your RDS PostgreSQL database secure.
14:00 - 14:50 Bowery Regulated IndustryRetrieving Hierarchy Data From Relational Tables
presented by Ben Lis
Recursive queries enable PostgreSQL users to retrieve hierarchical data from relational tables. Attendees learn:
- why we need recursive queries
- how they work
- how to write them
The talk assumes knowledge of SQL through outer joins and a general understanding of programming concepts.
The schema and queries used in the talk are available for download at https://github.com/benjli...
more 14:00 - 14:50 Grammercy Datapresented by Sridhar Paladugu
(Greenplum Technical Session for PostgresConf Wednesday
Successful enterprise AI applications in 2018 are ecosystems of machine learning solutions that tightly integrate a feedback loop triggering automated updates to the underlying algorithms - creating closed loop machine learning systems.
In order to efficiently build and scale these systems enterprises need reliable, highly perform...
more 14:00 - 14:50 Riverside Suite DataA talk on languages, datatypes, contrib and maybe even faces.
presented by Jacques Combrink
Introduction on what makes PostgreSQL such a great DB. How extendable it is in terms of languages, contrib packages and the different data types. From there I will show how you can find, hash and effectively search on faces using a the correct types, python and some prebuilt deeplearning.
This is a soft talk only touching on all the extentions and cute features in postgres, closing out wi...
more 14:00 - 14:20 Riverside Ballroom Use Cases-
presented by Shashank Jain and Dinesh Kumar
SAP Cloud Platform (SCP) is an open platform-as-a-service (PaaS) product that provides core services, for building and extending cloud applications on multiple cloud IAAS. SCP supports AWS, OpenStack, Azure and GCP.
One of the core services provided by SCP is PostgreSQL as a Service (PostgreSQL-as-a-Service). Each PostgreSQL-as-a-Service instance(cluster) consists of 5 VMs (PG-Mast...
more
14:00
-
14:50
New York Ballroom East
Development
Make monitoring great again!
presented by Peter Celentano
Regardless of where they're hosted, knowing what your database is doing at all times is a necessary component in a solid infrastructure. Grafana and TimescaldDB form the foundation of a monitoring stack which can be built and scaled to monitor nearly any production environment. While solutions such as Zabbix and Nagios allow for similar solutions, they are often cumbersome to navigate, need con...
more 14:00 - 14:50 New York Ballroom West Ops and AdministrationHow we got our queries down from 16sec to 20ms
presented by Jonathan Salama and Karina Ruzinov
Have you ever tried to debug an API endpoint and realized it wasn’t your nested for loops taking a long time, but in reality it was a seemingly innocuous query. We did too. This led us down a path of learning the ins and outs of index storage, functionality, and benefits.
An index can both be very powerful and very dangerous, at the same time, unless you understand the right place and time ...
more 14:30 - 14:50 Riverside Ballroom Use Casespresented by Hans-Jürgen Schönig
Are you suffering from bad performance? That might not be necessary. A simple and easy approach can help to sort out many performance problems fast and efficiently. By using a systematic approach many nasty issues can be solved and users can be kept happy. During this talk you will learn to systemmatically approach a system and make it fast again.
15:00 - 15:50 Grammercy Datapresented by Denish Patel
If you are a Developer and/or a DBA, you must have come across this question: "why isn’t my query using an index?" This session explores the factors that influence the Postgres optimizer’s approach to this question. Learn about the concepts of cardinality and histograms and hear how these affect the optimizer's calculations. See a demonstration of how these concepts work in practice using real...
more 15:00 - 15:50 New York Ballroom East Developmentpresented by Aditya Padhye and Ambarish Joshi
In this demo, we’ll see how to use Airflow to build and manage Data Science workflows in Greenplum. We'll also take a look at how we can quickly iterate on, and continuously improve, data science models that have been deployed.
Technologies used: - Greenplum - Airflow
15:00 - 15:50 Riverside Suite DataLet's build postgres
presented by Abhijit Paithankar
Crave.IO is a startup in stealth-mode. We have developed a containerized parallel build-accelerator for large-scale software projects, including Linux and Postgres, reducing hours-long builds from hours to minutes. The Crave SaaS platform currently uses Postgres to manage all user-data, orchestrate container configuration in the cloud, user performance metrics etc.
In this presentation, we w...
more 15:00 - 15:20 Riverside Ballroom Use CasesIntroduction to the tools and best practices from 3 years in production environment
presented by Rafał Hawrylak and Michał Gutkowski
Every production system needs monitoring. Things get interesting when there are hundreds of PostgreSQL instances with automated cloud deployments. There are multiple solutions for collecting PostgreSQL metrics. Grafana is a popular choice for visualization and analytics of time series data. Prometheus is a convenient tool to collect and store metrics of multiple PostgreSQL instances. Alertman...
more 15:00 - 15:50 New York Ballroom West Ops and Administrationpresented by Pratik Verma and Prasad Mujumdar
"Coming together is a beginning. Keeping together is progress. Working together is success." --Henry Ford
A data lake is where data generated by different applications, commercial and homegrown) of multiple business units is stored in one place. It’s operated upon by multiple compute platforms like Postgres/Greenplum and Hadoop. Given that all the business data comingle, the access control ...
more 15:00 - 15:50 Bowery Regulated IndustryUpgrading can't be that hard, right?
presented by Cory Stephenson
With the advent of PGLogical, upgrading a PostgreSQL cluster with (relatively) zero down-time is somewhat easy. But PGLogical requires PostgreSQL 9.4 or later. With 9.3, it’s not even close to easy. Add in over a hundred clusters, with a combination of read-write, write-only, sharded clusters, and combinations of the three, and you have a whole cluster of problems. Join me as I describe how we ...
more 15:30 - 15:50 Riverside Ballroom Use Casespresented by PostgresConf Organizers
Where all the connections happen!
16:00 - 19:00 Metropolitan Ballroom East.
presented by Baron Schwartz
Does EXPLAIN confuse you? You're not alone. This will help you understand what EXPLAIN output means, how to interpret it, what's good/bad in a given EXPLAIN plan, and how to change queries, indexes, or schema to get better query execution plans.
09:00 - 09:50 Chelsea Developmentpresented by Nikolay Samokhvalov
SLIDES: https://docs.google.com/presentation/d/1jinPA8Y5K_H8iKngG-Utpg1d8mOOVD7J3KQhX53z4XY/
What is the optimal value of shared_buffers for your database and workload? 8 GiB? Or 16? Is it worth using 70% of RAM if you run Postgres 11 on a server with 512 GiB of memory?
Of course, you might check dozen...
more 09:00 - 09:50 Bowery Use CasesWhy Migrate, Where to Start, How to Proceed
presented by Prashant Agarwal
Presenter: Prashant Agarwal - Senior Development Manager, EnterpriseDB
Learn how you can break free from on-premises Oracle and quickly migrate to Postgres in the cloud. The session will cover step by step instructions for executing a migration from Oracle to Postgres:
Why migrate, challenges for database migration and more
Where to start and how to pick migration targets
A li...
presented by Ivan Novick
This is a private event.
09:00 - 17:00 Riverside Suitepresented by Dan McClary
Learn all about Postgres@Google
10:00 - 10:10 New York Ballroom East General Sessionpresented by Sunil Kamath
Come join us to learn about how Microsoft is innovating on behalf of customers to deliver delightful Postgres experience and enterprise capabilities building on community edition. You will hear updates on some of our latest work, new announcements and also get a glimpse of our future roadmap.
10:10 - 10:20 New York Ballroom East General SessionMultiple data models, multiple use cases
presented by Simon Riggs
Keynote for PgConf.US, as a result of being a Sponsor: Simon describes the ways in which PostgreSQL is capable of many use cases, catering for many different kinds of data model and data handling.
10:20 - 10:30 New York Ballroom East General Sessionpresented by PostgresConf Organizers
Coffee Break
10:30 - 10:50 Metropolitan Ballroom EastUsing Row Level Security to move access control from the application to the database.
presented by Bennie Swart
In this talk we examine the use cases of Row Level Security (RLS) which was introduced in PostgreSQL 9.5.
In traditional applications, it is very common to find a lot of access control code residing inside the application layer, where the application connects to the database with a single database user, but multiplexing multiple application users. The database user typically has more rights ...
more 11:00 - 11:50 Grammercy Use CasesI/O sub-system cost/performance design considerations.
presented by Jeffrey Zampieron
The rise of cloud hosting has led to the conversion of cap-ex to op-ex in a number of scenarios. The promise of click to scale and easy availability of high-performance IaaS components are clearly attractive. In this work we investigate the performance of a variety of cloud-hosted IaaS configurations compared against some off-cloud VM and bare-metal options when running pgbench workloads. Pract...
more 11:00 - 11:50 Chelsea Ops and AdministrationData-science at Equinox, a luxury fitness club
presented by Will Liu, elliott cordo, and Paul Singman
Quick overview of Redshift and its Postgres heritage, similarities like SQL interface and differences like storage and scalability.
How we use Redshift as a data warehouse solution at Equinox to analyze data and provide insight.
Some strengths and short-comings of Redshift when it comes to data consumption and how we use Postgres to compensate, specifically with the dblink extension, to b...
more 11:00 - 11:50 Bowery Datapresented by PostgresConf Organizers
Lunch
12:00 - 13:00 Metropolitan Ballroom EastIntroduction to the core architecture of FoundationDB
presented by Ben Collins
The FoundationDB Key Value store exposes an API with a strong resemblance to a B-Tree storage engine. This API is similar to internal APIs used in most RDBMSs, the API has concepts such as global versions and allows for MVCC operations. The design goal this this API was to mimic such a storage engine API such that it could be a state storage layer that would allow for the development of full da...
more 13:00 - 13:50 Grammercy DevelopmentA look at the current state of PostgreSQL backup tools
presented by Hannu Valtonen
As data amounts have grown, the requirements for backup software have changed greatly over time.
As your cloud VM is not your father's personal computer neither are these modern PostgreSQL backup daemons the same as your forefathers backup tools.
In this presentation I'll go through some of the current crop of popular cloud enabled PostgreSQL backup daemons to see how they've evolved to f...
more 13:00 - 13:50 New York Ballroom East Ops and AdministrationAn introduction to YugaByte DB
presented by Karthik Ranganathan and Mihnea Iancu
Building applications on PostgreSQL that require automatic data sharding and replication, ability to scale out with fault tolerance and geographic data distribution is hard. In this talk, we will look at how to achieve this using YugaByte DB, a distributed database that is a storage engine for PostgreSQL while simultaneously being fully wire-compatible with NoSQL APIs (Apache Cassandra and Redi...
more 13:00 - 13:50 Bowery DevelopmentEmpowering Organizations to Analyze the Past, Understand the Present, and Predict the Future
presented by Mike Freedman
Time-series databases are one of the fastest growing segments of the database market, spreading across industries and use cases. Common requirements include ingesting high volumes of structured data; answering complex, performant queries for both recent and historical time intervals; and performing specialized time-centric analysis and data management.
Today, many developers working with tim...
more 13:00 - 13:50 New York Ballroom West Developmentpresented by Eric Sheetz
Creating applications that are both resilient and scalable is an essential part of any application architecture.In this session, you'll learn how to use Google Cloud SQL for PostgreSQL to build scalable and resilient application architectures using patterns and practices that apply broadly to any web application.
13:00 - 13:50 Chelsea Use CasesA look at most important postgres and OS parameters to improve performance
presented by Payal Singh
Looking at the list of kernel and postgresql.conf parameters, tuning your PostgreSQL database may seem like a daunting task. While each parameter has a role to play, in most scenarios just a handful of these end up impacting performance the most. This talk aims to be a quick practical guide to tuning your PostgreSQL cluster to get the best possible performance with the least effort, with specia...
more 14:00 - 14:50 New York Ballroom East Ops and Administrationpresented by Andrew Dunstan
Stored procedures have arrived with PostgreSQL version 11. Come see what has been implemented, and how procedures differ from functions. We will discuss some use cases for procedures such as migrating from other database management systems and manipulating data.
14:00 - 14:20 Bowery DevelopmentHow Timescale’s open source extension increases the readability, performance, and efficiency of “gap-filling” queries.
presented by Matvey Arye
One property of real-time data is that it often arrives at irregular intervals. An example of this is minute-by-minute averages of temperature sensors where data arrives from a sensor that is intermittently offline.
When querying such data, we often want results even for minutes that do not have data data reported from the sensor. In such cases, we often want data for such “gaps” reported ...
more 14:00 - 14:50 Grammercy DataPostgreSQL FDW
presented by Ibrar Ahmed
PostgreSQL provides a way to communicate with external data sources. This could be another PostgreSQL instance or any other database. The other database might be a relational database such as MySQL or Oracle; or any NoSQL database such as MongoDB or Hadoop. To achieve this, PostgreSQL implements ISO Standard call SQL-MED in the form of Foreign Data Wrappers (FDW). This presentation will explain...
more 14:00 - 14:20 Chelsea Use Casespresented by Grant McAlister
In this session we will dive deep into HOT (Heap Only Tuple) update optimization. Utilizing this optimization can result in improved writes rates, less index bloat and reduced vacuum effort but to enable PostgreSQL to use this optimization may require changing your application design and database settings. We will examine how the number of indexes, frequency of updates, fillfactor and vacuum se...
more 14:00 - 14:50 New York Ballroom West Ops and AdministrationControlling Kubernetes from psql
presented by Liz Frost
My day job is as a Kubernetes project contributor, and Postgres Foreign Data Wrappers have always held a special place in my heart. Combining them is only natural.
Kubernetes uses etcd as its data store, but it maps very well onto Postgres. All APIs follow a very uniform schema that makes programmatically mapping objects to tables easy. The user experience I settled on makes it easy to pres...
more 14:30 - 14:50 Bowery Developmentpresented by Tony Perez
Packet is an innovative datacenter startup that offers unique architectures for webscale and edge-computing applications. Packet uses Postgres extensively for the tracking of system and network assets, customer usage metrics and billing. We will share our experiences using Postgres at scale in a mission-critical function, our journey with Postgres over the years as we expanded globally and buil...
more 14:30 - 14:50 Chelsea Use Casespresented by PostgresConf Organizers
Coffee Break
14:50 - 15:10 Metropolitan Ballroom Eastpresented by dharshan rangegowda
Thinking about moving your PostgreSQL deployments to the public cloud, or already have and wondering if your setup is properly optimized? Discover the top 10 questions to ask yourself before deploying PostgreSQL on public clouds, and how to fortify your production environments with high availability, disaster recovery, and advanced monitoring to stay on top of your deployment performance. We'll...
more 15:20 - 16:10 Grammercy Ops and AdministrationWork Smarter and Become More Agile by Delivering Code Faster
presented by Jeffrey Surretsky
Shorten your database development cycles without compromising quality and performance • Integrate Database Development into your Continuous Integration workflow • Work smarter and become more agile by delivering code 40% faster
15:20 - 16:10 Bowery DevelopmentPostgreSQL partitioning and sharding strategies
presented by Jobin Augustine
Title : Divide and Conquer data:- Advanced Methods for partitioning and sharding data - Latest developments
- Abstract
- Historically PostgreSQL was not known for features available for horizontal scalability in handling large data sets. Many PostgreSQL based systems were designed to handle specific types of business cases. But the PostgreSQL as its core was a bit lagging beh...
Creating dashboards and alerts using PostgreSQL tables without deploying a single line of code
presented by Preetam Jinka
Observability is a core value in our engineering organization, and while we have plenty of manual instrumentation in our application code, we've found that creating visualizations directly from PostgreSQL data gives us a whole new approach to improving system observability. Using PostgreSQL as a Grafana datasource allows you to directly visualize the source of truth instead of depending on appl...
more 15:20 - 15:40 New York Ballroom West Ops and AdministrationAn overview of the PostgreSQL in China, the Community and China PostgreSQL Association organizations
presented by Grant Zhou
I will give the Community an overview of the PostgreSQL ecosystems in China, and introduce the COPU, China PostgreSQL Association organizations to the community for the connections to China. Also, I will present the PostgreSQL technical practices and trends in China. China PostgreSQL Association welcomes PostgreSQL contributors, corporations to extend their influences to China and the Associati...
more 15:20 - 15:40 Chelsea Use Casespresented by Mara Lemagie
No matter what your current level of interacting with databases, everyone should have at the ready ways of explaining how databases work, whether it is to your grandmother or non-database oriented coworkers. This talk will be equally useful to new PG users who want to build good habits for writing their queries and seasoned veterans who are recruiting and training new data professionals. We wil...
more 15:50 - 16:10 New York Ballroom West Ops and AdministrationHow to stop worrying and love LIDAR software
presented by Karl Pietrzak
Recent growth in the IoT industry has led to increased demands of relational databases such as PostgreSQL. In this presentation, we will use a combination of TimescaleDB and pgpointcloud to:
- create a database solution supports very high ingest rates
- while also providing compression across multiple dimensions
W...
more 15:50 - 16:10 Chelsea Use CasesA deep dive into wait events on PostgreSQL
presented by Jeremy Schneider
PostgreSQL 9.6 introduced wait events and PostgreSQL 10 progressed them, but what are they? What do they mean? How do I find them and how do I make them go away? Wait events are one of the most significant advancements in observability for PostgreSQL databases; their usefulness is unparalleled in troubleshooting performance. This talk will go into all that and more as we explore the world of ...
more 16:20 - 17:10 New York Ballroom West Ops and AdministrationEverything else you need for a successful deployment
presented by Andrew Dunstan
PostgreSQL is not deployed in a vacuum, it relies on many other elements. This talk focuses on all the things that aren't PostgreSQL that you need to make your use of it easy and successful. We'll talk about tools for administration, for monitoring, for backups, for high availability, for debugging, for disaster recovery. We'll also talk about the elements of a management plan for a successful ...
more 16:20 - 17:10 New York Ballroom East Ops and Administrationpresented by Nicolas Gakrelidz
This session will give Data Professionals (Analytics Leaders, Data Engineers, Data Scientists, Data Analysts) a roadmap for navigating the path to Enterprise AI and driving data science at scale using Postgres\Greenplum with Dataiku.
Digital transformation are the operative words in strategic plans in enterprises across all industries. Organizations must use data to continuously develop m...
more 16:20 - 16:40 Grammercy Datause cases from Near real time upgrades to Change Data Capture
presented by Tom Kincaid
2ndQuadrant the contributors of logical encoding and logical replication to Postgres, has developed an open source extension called pglogical. This talk will cover the evolution of logical replication through out the history of Postgres and it will also cover the primary use cases of logical replication such as near real time upgrades and change data capture. It will then dive into specific f...
more 16:20 - 17:10 Chelsea Use Cases-
presented by Ankita Rani, Nishtha Srivastava, Vinayak Jadhav, Shashank Jain, and Dinesh Kumar
Multi-Tenant-PostgreSQL-as-a-Service in SAP Cloud PLatform
SAP Cloud Platform is an open platform-as-a-service (PaaS) product that provides core platform and backing services, for building and extending cloud applications on multiple cloud infrastructure providers.
One of the core services provided by SCP is Multi-Tenant PostgreSQL as a Service (MT-PostgreSQL-as-a-Service). MT-Pos...
more
16:20
-
17:10
Bowery
Development
presented by Jeffrey Surretsky
This presentation showcases the following products in a Postgres migration context:
· Toad Data Modeler · Toad Data Point · Benchmark Factory · Foglight for Cross Platform Databases · SharePlex · Toad Edge for Postgres
09:00 - 09:50 New York Ballroom West DataDev Ops tools to keep your database healthy
presented by Bernhardt Garlipp
Dev-Ops. Imagine the following "Developer commits new feature. Feature passes Automate test. Feature gets Deployed. Oh no ... everything is slow."
This talk is aimed to provide methods and tools on how to identify slow and problematic queries on a database cluster and how to address and solve these problems. The second part of the talk will be aimed at lessons learned in our company and how ...
more 09:00 - 09:50 Riverside Ballroom Ops and AdministrationManaging PostgreSQL with a cloud oriented approach
presented by Julian Fischer
Managing hundreds of highly available PostgreSQL instances and their life cycle is not a dream. Using BOSH, the release engineering, deployment and life cycle management software for small and large-scale cloud software, can help with failure recovery and software updates with zero-to-minimal downtime, but even then there is a long way to go until full automation.
Attending this talk partici...
more 09:00 - 09:50 New York Ballroom East Ops and AdministrationA collaborative data science platform
presented by Venkata Jagannath
Join this demonstration of TIBCO Data Science, a collaborative data science platform for data preparation and analytics at scale with PostgreSQL and Greenplum, and a Gartner Data Science Leader. The San Francisco Office of Financial Justice aimed to discover insights on the impact of parking citations on low income drivers in San Francisco. The team discovered that older cars, which are assume...
more 09:00 - 09:50 Riverside Suite DataP is for Persistence
presented by Justin Reock
Learn how to use Postgres as a backing persistence adapter for the ActiveMQ messaging platform, as well as an integration endpoint for the powerful Apache Camel integration framework. Not only will you learn about JDBC, but you'll also get a solid introduction to these two mature and powerful integration platforms.
10:00 - 10:50 Riverside Ballroom DataSome Use cases as to when you shouldn't use JSONB in postgres.
presented by Anson Abraham
Use case of using JSONB in PG, when it works and when it doesn't work, and you'll need to work another doc store and use FDW to access from PG.
10:00 - 10:50 New York Ballroom West Use Casespresented by Preetam Jinka
ShiftLeft is a security startup in the Bay Area that's working on a combined static analysis and runtime protection security solution. The code analysis process starts with either JVM byte code or source code and is transformed into a novel graph representation called a Code Property Graph (CPG). This CPG, which is a combination of syntax, control flow, and data flow information, is then inspec...
more 10:00 - 10:50 Riverside Suite Use CasesPostgres 11 edition
presented by Jim Nasby
The use of Multi-Version Concurrency Control (MVCC) is perhaps one of the most powerful features PostgreSQL has to offer, but it can be a source of confusion for new and experienced users alike. In this talk we will provide an in-depth walkthrough of why Postgres needs to vacuum and what vacuum does.
Topics:
- MVCC details
- HOT overview
- Identifying tuples to be vacuumed/frozen
- Vis...
presented by PostgresConf Organizers
Panel discussing opportunities and challenges to increasing diversity and inclusion in technical communities.
- Panelist: Mai Ton — VP of Humans at White Ops
- Panelist: Alexandra Paredes — Head of Engineering at Code Climate
- Panelist: Rachana Kumar -- Director of Engineering at Etsy
- Panelist: Carlos Rymer -- Engineering Manager at New York Times
- Moderator: Mustafa Khan — Head o...
presented by PostgresConf Organizers
11:00 - 14:00 Metropolitan Ballroom East
presented by PostgresConf Organizers
12:00 - 13:00 Metropolitan Ballroom East
Building a modern PostgreSQL developer experience
presented by Dr. Rony Chatterjee and Anup Kamath
Azure Data Studio is a cross-platform database tool for data professionals using the Microsoft family of on-premises and cloud data platforms on Windows, MacOS, and Linux. We have introduced support for PostgreSQL database in Azure Data Studio to support your database development in modern editor experience with Intellisense, code snippets, source control integration, and an integrated terminal...
more 13:00 - 13:50 Riverside Ballroom DataPostgres transactions for fun and profit
presented by Alexey Klyukin
Did you ever wonder why vacuum cannot be run in a transaction? Can you make foreign data storages magically support ACID transactions by wrapping your statements against foreign tables in Postgres with BEGIN ... COMMIT? If you ever asked those questions or want to learn what is possible with Postgres transactions, then this talk is for you! Postgres can durably store your data, but it is also m...
more 13:00 - 13:50 New York Ballroom West DevelopmentThis talk is a quick introduction to GraphQL subscriptions, the different approaches of implementing change capture on Postgres and streaming to web clients and finally an overview of the implementation in the open-source Hasura GraphQL engine
presented by Tanmai Gopal
GraphQL is a new API format for webservers to talk to clients that is rapidly gaining popularity, as an alternative to REST-ish designs. The GraphQL spec allows for "subscriptions" will allows a webserver to push changed results or events to a web-client. Building a scalable & reliable GraphQL server is a challenge.
At Hasura we've built a service that automatically generates a GraphQL API ...
more 13:00 - 13:20 Riverside Suite Use Casespresented by Robert Treat
Postgres 10 added a new form of replication; logical replication; into the wide array of replication and failover options available within the Postgres ecosphere. In this talk, we'll cover:
• a (very) brief history of replication in Postgres
• what is logical replication?
• differences between Postgres 10 and 11
• how does it differ from other replication options?
• good use cases f...
more
13:00
-
13:50
New York Ballroom East
Ops and Administration
An Introduction to Common Table Expressions
presented by Hein Kruger
This is a Quick introduction to the use of CTE's . I will touch on the benefits but mostly its just some practical uses of the CTE structure as well as the use of recursive CTE's.
- What is a Common Table Expression
- Basic Syntax
- Basic Example
- Comparing a CTE with a Nested Query
- Selecting from a CTE
- Using an Update in a CTE
- Using a Delete in a CTE
- Using a Inse...
Build Data Pipelines without a single line of code!
presented by Viktor Gamov
Have you ever thought that you needed to be a programmer to do stream processing and build streaming data pipelines? Think again!
Companies new and old are all recognizing the importance of a low-latency, scalable, fault-tolerant data backbone, in the form of the Apache Kafka® streaming platform.
With Kafka, developers can integrate multiple sources and systems, which enables low latenc...
more 14:00 - 14:50 New York Ballroom East DataTaking advantage of multithreading and SIMD in Rust
presented by Jack O'Connor
Slides: https://jacko.io/bao_presentation/presentation.html
Bao is a cryptographic tree hash, based on BLAKE2 and implemented in Rust. Tree hashes are more flexible than conventional hash functions, and Bao can verify small pieces of a file independently rather than just the whole thing at once. Tree hashes also make it possible to hash different parts of a file in parallel, which makes them...
more 14:00 - 14:50 New York Ballroom West DevelopmentPostgreSQL Performance Tuning
presented by Avinash Vallarapu
*Abstract : *
In the proposal, I would like to share how we have optimized a few database parameters that did outburst the performance by several times. It was a PostgreSQL Server of size 325 GB and more than 1500 TPS. After our tuning, we were able to get the database process upto 4900 TPS without compromising on ACID or availability.
** The talk includes : **
- The environm...
presented by Lloyd Albin
A shadow table is an table that is read and written by a trigger function and contains data similar to (in the same format as) its primary table, which is the table it's "shadowing" along with a few extra fields so that you can time travel the shadow table and know who did what and when. In this presentation we will cover traditional Shadow Tables which works with all current version of Postgre...
more 14:00 - 14:50 Riverside Ballroom Datapresented by PostgresConf Organizers
The conference wrap-up and talking about PostgresConf 2020
15:00 - 15:20 New York Ballroom East