In this episode we explore the new features of Seqera's Data Studios and Data Explorer, with Phil Ewels, Rob Newman and Rob Syme from Seqera.
Discover how to use these tools for troubleshooting Nextflow pipelines, tertiary analysis and Nextflow development. We discuss the pain points that led to the creation of Data Studios and how it's designed to allow scientists to interactively and collaboratively work with data and complex workflows, without having to move large datasets around.
Rob Syme wows us with another fantastic practical demonstration, setting up and using Data Studios to write and test a Nextflow pipeline in VSCode running on the cloud in a Data Studio environment, including running the Nextflow CLI with task submission to AWS Batch.
We cover features like session persistence to save work states, and upcoming custom container support for your own specialized applications.
Learn how these tools can enhance your computational biology projects and make seamless cloud integration a reality.
00:00 Channels Podcast 43: Data Studios 00:26 Introductions 01:54 Data Studios 04:51 Move the compute to the data 06:13 Real-time collaboration 06:47 Data Explorer 09:41 Access to public data 10:45 Data Explorer demo 13:56 Data Studios setup 20:17 Session persistance 22:52 Data Studios RStudio demo 28:24 Nextflow development in Data Studios 36:17 Future development 37:01 Custom containers 40:01 Boston Summit demo 44:01 Lifetime management 47:14 Wrap up