202 avsnitt • Längd: 25 min • Månadsvis
Arrested DevOps is the podcast that helps you achieve understanding, develop good practices, and operate your team and organization for maximum DevOps awesomeness.
The podcast Arrested DevOps is created by Matt Stratton, Trevor Hess, Jessica Kerr, and Bridget Kromhout. The podcast and the artwork on this page are embedded on this page using the public podcast feed (RSS).
Openness plays a significant role in propelling DevOps and organizational processes forward. This is not to imply that everything must be open, but the default should be openness unless a valid reason indicates otherwise.
Andrew Zigler, developer advocate at Mattermost, and Matty from Arrested DevOps recently shared insights on this subject. They discussed creating impactful developer advocates, managing community writing programs, and dealing with the challenges of open source communities.
Andrew emphasizes that the loudest and most contributory voices in open source projects are usually the paid internal staff. However, he champions setting up pathways in the community to validate the experience of all contributors and reward them with anything from thought leadership, platforms, or even swag. The key is to influence individuals at all levels of engagement and ensure that they feel they own part of what they are contributing.
One of the challenges he identified is over-influencing which often stems from the fact that the paid staff are the ones driving the open source project vehicle. This imbalance usually drowns out the voices of other contributors, particularly those who may not have the luxury of dedicating as much time and energy to the project as the paid staff.
Andrew suggests a solution: the company creating more developer advocates through the multiplier effect. This means ensuring that everyone across the board understands the importance of the open-source community and empowers them to contribute. The more developers contribute, the larger and more diversified the community becomes, leading to better outcomes and solutions.
Matty highlights how vital leadership is in these initiatives. By allocating resources, prioritizing open source community engagement, and maintaining a strategic focus, leaders can do much to foster a healthy open-source community. Successful leaders understand that engagement levels differ, so they create opportunities for different levels of contributors to partake and contribute to the community.
To ensure the project remains harmonious and aligned with company goals, the leadership should give equal weight to both staff and contributors’ voices. In the end, everyone involved in the project is part of the community.
The conversation took an interesting turn when they started discussing engineering blogs, a tricky subject for many organizations. Matty points out that these blogs have the tendency to publish sporadically, often dominated by lengthy droughts of content or a sudden overflow of posts.
Such inconsistency happens when the contributors, mostly engineers, write when they can spare the time. Balancing this dynamic is crucial, and one suggested solution is to involve people whose primary job is creating content. They can collaborate with subject matter experts to create consistent, relevant content.
Operating under a default open environment for your projects does not mean that everything has to be open. Nevertheless, transparency and openness should be the norm unless necessary otherwise. By dealing with the occasional echo chamber and understanding that contributions will always ebb and flow, the community will thrive and keep moving forward.
In line with the open source spirit, scaling advocacy is crucial in DevOps. It involves not only the individuals whose title is developer advocate but everyone within the company. By creating more advocates and amplifying community efforts, the DevOps movement continues seamlessly.
DevOps World is back for 2023, and you won’t want to miss out on this one-of-a-kind event! This year’s program is packed with exclusive insights, immersive workshops, and unparalleled networking opportunities taking place across multiple cities in the US, UK, and Asia. Elevate your DevOps game and register using the following links: NYC area, Chicago, Silicon Valley, Singapore, and London.
DevOps World is back for 2023, and you won’t want to miss out on this one-of-a-kind event! This year’s program is packed with exclusive insights, immersive workshops, and unparalleled networking opportunities taking place across multiple cities in the US, UK, and Asia. Elevate your DevOps game and register using the following links: NYC area, Chicago, Silicon Valley, Singapore, and London.
Whether you’re working for a startup or a corporate giant like Visa, Target, or JP Morgan Chase, your personal brand helps to define your professional identity. This brand is not just about showcasing your GitHub contributions, but about telling your unique story. It’s about how you solved a particular problem or contributed to a project, not just about the technologies you used.
Building your personal brand starts with self-reflection. Here are three questions to ask yourself:
Who are you professionally? Identify your top three technical specialties, professional specialties, and team contributions.
Who are you personally? Identify three interests and hobbies, your most important beliefs and values, and aspects of your social, family, or community life that you want to connect with people over.
How do you connect with people? Identify shared interests, the advice or information you’re seeking, and what you want to know about other people.
Answering these questions gives you a wealth of material to draw from when telling your story and helps you identify your strengths and areas of expertise.
When you’re creating your brand content, remember the old adage: show, don’t tell. Instead of proclaiming yourself a “thought leader,” demonstrate your expertise through your work. Share your accomplishments in a way that highlights how your work benefited others, not just yourself. This approach makes your story more engaging and relatable.
Being open about what you don’t know can be a powerful part of your personal brand. It shows that you’re a lifelong learner, open to new ideas and willing to grow. Plus, asking for help or resources can lead to valuable connections and insights.
Sharing your knowledge and helping others is a powerful way to build your personal brand in tech. It demonstrates your expertise and your willingness to support your peers. This approach is not about trading favors but about creating a positive ripple effect in your community.
In conclusion, building a personal brand in tech is about more than just showcasing your skills. It’s about telling your story, connecting with others, and contributing to your community. By being authentic, open, and generous, you can create a personal brand that truly stands out.
Jess and Roni talk about what continous feedback: where it came from, what it looks like in the context of a dev proces, and the benefits it can bring to engineers and developers. They also discuss Roni’s observability project, Digma.ai… and his other passion, complicated board games.
Breaking Down Gates with Tim Banks (previous episode with Tim)
You know how so many devops folks are like “lol let me tell you about this time I accidentally took down production”? I would *love* to get to that level of blamelessness and shame-free acceptance with phishing, security misconfigurations, and other such mishaps.
— Kat Sweet (@TheSweetKat) October 25, 2021
If you're in infosec, what are you 5 top priorities for your team right now?
— matty stratton (@mattstratton) October 26, 2021
Melody: melody.dev
Approval testing tools mentioned
Bridget chats with Kent Rancourt about Brigade, a tool for running scriptable, automated tasks (in Kubernetes).
(episode art by Ashton Rodenhiser)
In this episode, Jess talks with guest Cat Swetel about her career, writings, and thoughts on DevOps.
Jess: “Cat is known internationally for her Penguin Power Stance and for standing on the edge of now!” Cat: “I like working with things on the edge of now. So that’s either things that shouldn’t exist anymore or shouldn’t exist yet.”
Jess and Cat talk about projects that fit Cat’s definition of the edge of now.
Cat: “I do believe that DevOps is inherently feminist, because it puts the emphasis on that maintenance and reproductive work rather than producing working software.”
Jess brings up Eric Evans’ concept of software as gardening.
The panel discusses reproductive vs productive work. Cat talks about what she sees in healthy DevOps teams.
Jess and Cat talk about the challenges of working with legacy systems.
The panel discusses metacommunication and Gregory Bateson.
Cat: “In that situation, everyone is operating as specified but it’s still not working! So that’s when it becomes necessary to not have so much of that transactional interaction…It has to be something more generative.”
Cat and Jess talk about the ethics of “care vs fair”. Cat dives into how feminist theory has impacted her thinking on DevOps and systems.
Cat: “If we valued caring for the systems and caring for each other, rather than thinking fair or unfair…Let’s check, are we caring for these systems, are we being mindful? Then rad, let’s keep going.”
Joe’s Babylon 5 talk at DevOpsDays Madison 2016
Senior + Principal folks: Is your resume contained to one page or do you opt for longer to encompass all your experience?
— emily freeman (@editingemily) November 10, 2020
I’m in a debate and I need your input.
Shout-out to Matt’s friend Marcelo for the link for Chili John’s (and for taking Matt there so many years ago)
Home recording setup for the win for @devopsdaysChi pic.twitter.com/N2qWFPDOsx
— Laura Santamaria @[email protected] (@nimbinatus) August 16, 2020
Image credit: Tea and Anarchy, modified from Anarchist Revolt
Font: 1403 Vintage Mono Pro by Jeff Kellem
Aaron discusses the three big projects he’s working on. The first is cost efficiency, or, as Netflix thinks of it, “cost compression.”
Jessica: “Oh, so you have to achieve cost compression without telling engineers not to spend money.”
Aaron: “If you don’t believe you can predict very well, then what you should do instead is get really good at reacting.”
The panel discusses the difference between autonomy and agency.
The panel talks about the utility of data dashboards.
Aaron: “The dashboards are more like cost debugging ultilities.”
Aaron talks about his second big project, “a unified strategy for access isolation.”
Jessica: “You want the cells to have access to the other cells in the same muscle tissue, but if they need a nerve ending they have to say so!”
Aaron explains the third big project he’s working on, “Netflix’s regional growth and high availability story.”
Aaron talks about how Netflix produces original content all over the world, and the coordination required to serve all the computation needs of the various projects in different places.
Ecology, the Ascendent Perspective
Aaron’s blog post, The Sufficiently Smart Engineer
OSM logo art credit: @flynnduism
Helm logo art credit: @flynnduism
Banner image: @bridgetkromhout
Kubernetes Rust Kubelet on GitHub
Introducing Krustlet, the WebAssembly Kubelet by Matt Fisher
Kubernetes and waSCC by Brian Ketelsen
waSCC capability that will make your ops folks cry - GitHub link to evil project (don’t install this)
Kubernetes: A Rusty Friendship - Taylor Thomas
WebAssembly meets Kubernetes with Krustlet by Ralph Squillace
Guests Dan Bergh Johnsson, Daniel Deogun, and Daniel Sawano join host Jessica Kerr to discuss their book Secure by Design.
Daniel: “There’s a lot of good designs which come naturally to us as programmers but which has the interesting side effect that they also prevent security-related bugs.”
The panel discusses domain primitives as an example of coding practices that naturally provide security through good design.
Dan Bergh: “It’s a good starting point to understand that using domain-driven design not only makes your code more expressive, solves more domain problems. Even though these designs were not crafted to address security to start with, they’ve also had that as a side effect.”
Jessica: “I love that what you’re recommending in this part is to think harder about what you do want in the system, express that in the code, and suddenly a bunch of things that you don’t want in the system just aren’t.”
The panel talks about the ways in which testing contributes to secure design.
Daniel Sawano: “It tends to be so much easier and more robust if you start defining your own domain types.”
The panel discusses the benefits of immutability.
Dan Berg: “It’s possible to…configure and mutate them until they are kind of safe-ish.” Jessica: “Kind of safe-ish?” Dan Berg: “Well, we are on a DevOps podcast.”
The panel talks about the security implications of logging practices.
Daniel Deogan: “One thing that’s very important is that if you log input directly into your logs, it becomes an attack surface for second-order injection attacks.”
Dan Bergh: “It’s a perfect launchpad for doing a really, really hard attack inside your system.”
Daniel Deogan: “The common mistake that many developers do is that they more or less dump inputs blindly.”
Jessica: “We have this illusion that logging is simple, but it isn’t.”
The panel discusses the chapter on cloud thinking.
Dan Bergh: “In a way, we’re instructing the system to become more intelligent.”
The book is available online in its entirety.
Check out A Minute on the Mic for bite-sized videos from experts on various topics!
The panel discusses the origins of the book Team Topologies. The project started with a blog post.
Matthew: “Back in 2013, I actually wrote a blog post in my personal blog. I actually wrote it in a rage.”
In 2015, Manuel joined the team to help expand on the ideas from that blog post and create Devops Toplogies.
Manuel: “What the hell are you calling a DevOps team? DevOps is not about creating a new team called DevOps.”
The panel discusses the impact of DevOps Topologies and some of the companies that have used it, including Netflix and Conde Nast.
Matthew and Manuel explain how the project has evolved over time as DevOps Topologies was being deployed in the real world.
Matthew: “It’s not just a set of patterns or templates. We wanted to provide an organizational capability for detecting when things have changed and have gone wrong.”
The panel discusses Conway’s Law and its implications for DevOps.
Manuel: “Teams are the means of delivering value.”
The panel discusses the importance of flow in both living systems and organizations.
Manuel: “It’s a more experiment driven approach where we have this goal or this need we need to meet and then allowing the teams to find the right solution.”
Jessica: “In modern systems, the flow is of changes to the flow of the product. It’s a very different level of work.”
The panel discusses the need for different team configurations that are constantly evolving.
Matthew: “It seems quite important to understand different kinds of dynamics in the organization at different times.”
The panel discusses the three team interaction modes laid out in the book: collaboration, as-a-service, and facilitation.
The panel discusses the four team topologies in the book: value stream aligned teams, enabling teams, platform teams, and complicated subsystem teams.
Jessica: “The limitation of a team is cognitive load. It’s not resources, it’s not pizza.”
Manuel: “Although pizza is very appealing.”
Manuel discusses Dunbar’s Number and how that concept can put useful constraints on teams.
Episode images by Jessica Kerr. Show notes by Tyler Wilson.
Rule #1 of open-source: no is temporary, yes is forever.
— Solomon Hykes / @[email protected] (@solomonstre) March 30, 2016
Switched from devrel to product. Helped with Helm 3 launch!!! Traveled less. Dragged Joe to a personal trainer because if he’s going maybe I will too.
Lots of speaking (spoke at 24 conferences in 2019). Moved back to Chicago. Went to lots of devopsdays. Got even more interested in resilience engineering and realized I don’t know shit.
More travel! A lot of customer work, focus on delivering solutions, now moving officially into product. I also got really into shuffleboard and DND
Consulting!
Bridget chats with Kelsey Hightower about Kubernetes and the future.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget chats with Ian Coldwater at the devops Minneapolis meetup about their KubeCon North America 2019 keynote.
Video from KubeCon: Hello From the Other Side: Dispatches From a Kubernetes Attacker
Art credit: Sarah Becan - original tweet, threadless store, website
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget chats with the authors of Kubernetes Best Practices: Brendan Burns, Eddie Villalba, Dave Strebel, and Lachlan Evenson. (Kindle version available now!)
Dave:
Brendan:
Eddie:
Lachie:
Bridget:
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget chats with guests Peter Shannon, Jocelyn Harper, and Tim Gross in front of a live studio audience at devopsdays Philadelphia 2019.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
In this episode, host Jessica Kerr is joined by guests Gene Connolly and Joan Freed to discuss DevOpsiCon and their DevOps transformation at Meltwater.
The panel discusses how DevOps has transformed the working environment at Meltwater. When Joan started at Meltwater, engineering and operations were kept entirely separate.
Joan: “There was this huge wall because engineering knew how things were built and operations knew the production systems…but there wasn’t any cross-pollination there.”
The lack of communication between the teams caused friction and slowed down product delivery. The process could take as long as six months.
Joan: “We’re a Software as a Service company so we always want to build products that are sticky and keep our customers around.”
Joan discusses how Meltwater began to move to a more agile business model. Getting engineers and operations people in the same physical location to work this out was important. Creating truly cross-functional teams couldn’t happen over Skype alone. DevOpsiCon was born to facilitate this transition.
Buy-in wasn’t immediate throughout the firm. Teams were updating legacy systems throughout the company and some were more focused on work than enablement.
Joan: “The first one was a little bit sneaky. The next one was in Manchester and that one was a little less sneaky.”
The panel discusses the importance of investment in enablement. This allows the feature teams to focus on the features they were actually building instead of all the underlying infrastructure.
Joan: “Having them build everything they need is not realistic.”
The fact that Meltwater has offices all over the globe makes any wholesale shift of company culture difficult. The panel discusses the need for face-to-face interaction to facilitate change.
Gene: “There are these events like DevOpsiCon where we solve the problem by bringing a broad set of people together.” DevOpsiCon has grown beyond just developers. Product, UX, support, and even sales have joined the event to help build relationships and give insight.
Jessica: “That is very DevOps in the sense that DevOps says that you need to take these concerns that cannot be separated…and you can’t put those responsibilities on different teams and make them fight.”
The panel discusses the value of DevOpsiCon as an unconference. Attendees collectively set the agenda on each day of the event.
Gene: “That format has been so much fun and created so much value.”
Gene: “The event becomes the conference you didn’t know you needed at the start. It adapts collectively to what the organization needs at that moment.”
Gene discusses how allowing teams the freedom to experiment can yield results that a top-down structure could not provide.
The panel talks about the structure of mission, framework and support teams.
Joan: “By keeping things loosely coupled, it gives teams a lot more freedom to make the technology choices that are best for them.”
The panel discusses the ownership challenges of moving from the datacenter to the cloud specifically and large enterprise changes more generally. The investment in enablement is critical.
Joan: “There’s very much a not built here mentality that can take place.”
Gene: “Software can benefit from being owned by many people.”
The panel talks about overhead costs to both software and human changes.
Gene talks about the challenges of moving to a full mission responsibility model.
Gene: “You need to find efficiencies more effectively than you’ve ever had to find efficiencies before.”
Gene: “Oh wait, you’re looking to me for an answer?”
The panel discusses the results of the ongoing DevOps transformation at Meltwater. The six month process for production changes has transitioned to a constant stream. Beyond the productivity, it has also been a quality of life improvement for the employees.
Gene: “We’ve got a Slack channel for all the production changes and it is constant activity in it all week long of changes going out.”
Joan: “We’re now getting hundreds of features out every six months to our customers.”
The panel discusses the ways in which transformation is, or should be, a constant process. The change goes on.
Episode art by Evelyn Kerr.
Matty Stratton talks with guests Ken Mugrage and Sasha Rosenbaum about their new event DeliveryConf.
Bridget chats with Devi Moodley, Daniel Maher, Adrian Moisey, and Cobus Bernard at devopsdays Cape Town 2019.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
In this episode, hosts Jessica Kerr and Matty Stratton are joined by guests Dr. Nicole Forsgren and Mr. Jez Humble, two of the authors of the 2019 Accelerate State of DevOps report.
The panel discusses the history and purpose of the report. This is the sixth year it has been produced. How did the report start and what questions is it seeking to answer?
Nicole: “Once upon a time, kids, making software was sad. Making software used to light people on fire!”
Nicole: “So why do we even do this DevOps thing? It’s because we want to make that software process easier and better.”
Jez: “These approaches have worked even in highly regulated environments. They’ve worked everywhere.”
Nicole explains that capabilities and practices are more important than tools.
Nicole: “There’s no such thing as DevOps in a box.”
The group discusses the limitations of systems data vs survey data, and the importance of collecting both. Survey data is low resolution but high signal and vice versa.
Jez: “Over-precision is something that’s a real problem in our industry.”
Nicole: [“How to Measure Anything” - Douglas W. Hubbard] ( https://www.amazon.com/How-Measure-Anything-Intangibles-Business-ebook/dp/B00INUYS2U )
The panel discusses the importance of starting from hypotheses instead of looking for any spurious correlations that exist in highly related systems.
The panel discusses some of this year’s findings. The “retail apocalypse” over the last decade has had some surprising effects. Regulation has less of an impact than many industries assume.
Jez: “We do see high performers in large companies who are highly regulated.”
Matt: “People will sit there and say well, I have over 5000 employees and I can’t DevOps so now I have an excuse, and that’s not the case.”
Change management processes are one of the key areas of difference between large and small enterprises. Jez discusses how to make that process more lightweight even in a large organization.
Jez: “Risk is also about upside risk. If you can’t move fast at delivering software, that’s a risk to your business.”
The panel discusses the importance of keeping a holistic view even in a large enterprise with specialized roles.
Nicole: “You go from change management theater to strategic change management for a massive organization. It’s scary but it is also dope!”
The report has found year after year that a slower change management process can paradoxically result in more instability, not less. The panel discusses the reasons this experiment was not successful and how enterprises can implement more nimble processes going forward.
The panel talks about what productivity actually means. The report uses an interesting definition: “Productivity is the ability to get complex, time-consuming tasks completed with minimal distractions and interruptions.”
Nicole: “You may be just closing the tickets that are meaningless but are easy to close.”
Research shows that real productivity reduces burnout and improves work-life balance. The group discusses how to separate that real productivity from gamification and busywork. Jez talks about scaling up and how centers of excellence may not be as useful as previously thought. Nicole points out that the report has given a lot of starting points for people to begin making changes in their organization.
Matty and Trevor chat with guests Jessie Frazelle, Veronica Hanus, and Jeff Smith in front of a live studio audience at devopsdays Chicago 2019.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget and Matty chat with guests Liz Fong-Jones, Alice Goldfuss, and RJ Williams, in front of a live studio audience at devopsdays Minneapolis 2019.
Liz Fong-Jones is a new organizer of devopsdays New York City. Alice Goldfuss is the MC for devopsdays Portland. RJ Williams is an organizer of devopsdays Raleigh.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Velocity Berlin Nov 4-7 2019 - discount code “ADO2019” gives 20% off for Gold, Silver, and Bronze passes, and Best Price ends August 2.
For any devopsdays, try the discount code ADO2019!
Trevor chats with Steven Murawski of Microsoft about Azure DevOps, Windows Terminal and all the cool things from Microsoft Build 2019.
On being a principal engineer - blog post by Silvia
Velocity San Jose June 10-13 2019 - discount code “ADO2019” gives 20% off for Gold, Silver, and Bronze passes.
For any devopsdays, try the discount code ADO2019!
CFP for devopsdays Chicago: open until May 3rd
Silvia: the Beyoncé movie came out on Netflix: Homecoming
Jessica: Do something outside! It’s spring!
Matty: Stocksy - for affordable stock imagery that benefits the artists and Super Team Deluxe for great pins and stuff
Velocity San Jose June 10-13 2019 - discount code “ADO2019” gives 20% off for Gold, Silver, and Bronze passes.
For any devopsdays, try the discount code ADO2019!
Velocity San Jose June 10-13 2019 - discount code “ADO2019” gives 20% off for Gold, Silver, and Bronze passes.
For any devopsdays, try the discount code ADO2019!
Previous episode: Punk Rock DevOps
Image credit: aesthetictech.net
Velocity San Jose June 10-13 2019 - discount code “ADO2019” gives 20% off for Gold, Silver, and Bronze passes.
For any devopsdays, try the discount code ADO2019!
This is the unofficial pilot for their new podcast: weird trick mafia
Jessie’s job-shadowing blog posts: Government. Medicine. Capitalism? (Wednesday, February 27, 2019) and Trust and Integrity (Friday, March 1, 2019)
Andrew mentions a book: Team of Teams
Jessie wrote a blog post about Intel SGX.
Image credit: oldpatterns
Jessie: QCon London, dotGo Paris
Andrew: devopsdays Atlanta
Velocity San Jose June 10-13 2019 - discount code “ADO2019” gives 20% off for Gold, Silver, and Bronze passes.
For any devopsdays, try the discount code ADO2019!
According to…
Greater Than Code podcast
Matty (with special guest host Jessica DeVita) is joined by Ana Medina, Dan Barker, Ben Clayton, and Monica Hart at DevOpsDays Kansas City 2018.
Trevor is joined by Jessica Deen at Microsoft Ignite 2018, and have a quick catch up on the event and the state of the DevOps world.
Trevor is joined by Edward Thomson at Microsoft Ignite 2018, and have a quick catch up on the event and the state of the DevOps world.
Trevor is joined by Steven Murawski at Microsoft Ignite 2018, and have a quick catch up on the event and the state of the DevOps world.
Trevor and Jason are joined by Donovan Brown at Microsoft Ignite 2018, and have a quick catch up on the event and the state of the DevOps world.
Matty is joined by VM (aka Vicky) Brasseur, Vice President of the Open Source Initiative, for a chat about contributing to open source software.
Matty has a chat with Christine Spang of Nylas about company culture and on-call techniques and war stories.
Matty chats with Nicole Forsgren, Wes Novack, a shadowy figure known only as Chris from Qualtics, Jason Vance, and Matthew Barlocker in front of a live studio audience at devopsdays Salt Lake City 2018.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget and Matty chat about organizing conferences with guests Jam Leomi, Debbie Gillespie, and Christian Herro, in front of a live studio audience at devopsdays Minneapolis 2018.
Jam Leomi is a past organizer of devopsdays Silicon Valley, and they just moved to Minneapolis. Christian Herro is a founding organizer of devopsdays Madison, and Debbie Gillespie is a new organizer of devopsdays Minneapolis.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Guests J. Paul Reed and Mary Thengvall talk about resiliency, safety, and a great new conference - REdeploy!
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
On this episode of Arrested DevOps, Matty and Bridget discuss conference speaking with participants at devopsdays Amsterdam 2018.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
On this episode of Arrested DevOps, Trevor is joined by Chloe Condon, Nathen Harvey, and Nell Shamrell-Harrington. Everyone on this episode has been a part of the theater community at some point in their lives. We talk about our individual journeys from theater into tech, the lessons we learned on the way, and how we leverage those lessons every day.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget curated the distributed systems track at GOTO Chicago 2018. In the last timeslot of the day, she gathered all the speakers from the track to discuss their topics.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
What is a developer advocate / tech evangelist? #devrel #devx #advocate. credit @mikegstowe pic.twitter.com/RvYysfEits
— 𝔹𝕖𝕣𝕟𝕕 𝕍𝕖𝕣𝕤𝕥 (@BerndVerst) September 5, 2017
Matt will be at the devops meetup in MSP on March 20 and then home for a bit. In April he’ll be at DrupalCon in Nashville, Devopsdays Des Moines, and GOTO Chicago. See mattstratton.com/speaking for more.
Lots of devopsdays: https://www.devopsdays.org/speaking/
ADO2018
for 20% off lots of devopsdays, 10% off ChefConf, 5% off GopherConJAYGORDON
for 25% offBridget discusses Devops Weekly with Gareth Rushgrove.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Definition of a hot take - “a piece of commentary, typically produced quickly in response to a recent event, whose primary purpose is to attract attention.”
Charity Majors (Honeycomb), Jill Jubinski (Fastly), and Eric Sigler (PagerDuty) sit down with Matt to have some fun with a few great DevOps “hot takes” and talk about what everyone is doing wrong.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget and Matt discuss the past and future of tech with Andrew Clay Shafer.
Guernica by Picasso
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
What is it like to change careers and get into tech later in life? Annie Hedgpeth and Megan Bohl tell their stories.
This episode also features special guest Sonia Gupta!
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget discusses all things Go with Brian Ketelsen and Erik St. Martin.
Artwork credit: Ashley McNamara’s Gophers art repo on github
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Matt, Trevor, Bridget, and Joe discuss 2017.
Past year-in-review eps:
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Discount codes: ADO2018 for 20% off lots of devopsdays, 10% off ChefConf
Open source projects have a lot of benefits, as we all know. But sometimes it can be a real challenge to take tools and projects developed internally and open-source them, especially from a traditional enterprise. Guest Aaron Rinehart shares his journey and story with open sourcing his internal security chaos engineering tool, Chaoslingr.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Discount codes: ADO2018 for 20% off lots of devopsdays, 10% off ChefConf
Matt refers to an episode of “Coupling” on the BBC - the episode was Series 1, Episode 2, “Size Matters” - here’s the quote:
Patrick: Oh, don’t be so piecey.
Howard: Typical lefty puritan..
Sally: Typical what? Come the revolution-.
Patrick: What revolution!? You guys are in power! We’re the revolution now!.
Sally: No. No, it can’t be right…
Patrick: You’re the evil empire..
Sally: No!.
Howard: Yes! Like Star Wars, and Patrick and me are the Rebel Alliance..
Sally: No! You’re not the goodies, we’re the goodies! We’re lefties!! We’re always goodies!.
Patrick: puts glass on his mouth and makes a Darth Vader voice No, Sally. You’re the establishment.
Image credit
By inkknife_2000 [CC BY-SA 2.0 (https://creativecommons.org/licenses/by-sa/2.0)], via Wikimedia Commons
Bridget sat down for a fireside chat with Alice Goldfuss (GitHub). No actual fires were harmed in the making of this episode.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Use code “ADO2017” for a discount on many devopsdays.
Devopsdays Madison 2017 was the second year for this event. With great speakers, workshops, open spaces, and sponsors, this event brought the community together to learn and share.
Bridget and Matt sat down with a speaker (Emily Freeman) and a couple of organizers (Joshua Zimmerman & Christian Herro) to talk about the wider themes of organizational change when you may not call all the shots in your org.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Use code “ADO2017” for a discount on many devopsdays.
Ines Sombra (Fastly) and James Turnbull (Empatico) are chairs of the Velocity conference series, which is celebrating its 10th year in 2017. They joined Bridget to talk about the events next month in New York and London, and share tips for making the most of your conference as well as submitting talks to future conferences.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Use code “ADO2017” for 20% off at Velocity New York Oct 1-4.
Use code “ADO2017” for a discount on many devopsdays.
Food Fight Show and Arrested DevOps joined forces to host our first live devops call in show! We featured Dr. Nicole Forsgren to answer your DevOps questions about measuring effectiveness, ROI of DevOps initiatives, and more!
Recorded May 8, 2017.
The unofficial title of this episode: “Ya sure, Nicole?”
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Use code “ADO2017” for 20% off at Velocity New York Oct 1-4.
Use code “ADO2017” for a discount on many devopsdays.
Bridget chats with Jez Humble (DORA) about continuous… everything!
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Use code “ADO2017” for 20% off at Velocity New York Oct 1-4.
Bridget will be at Uptime in Pittsburgh this week.
Use code “ADO2017” for a discount on many devopsdays.
Bridget and Matt chat about enterprise transformation and open source with guests Bryan Liles, Jessie Frazelle, and Andrew Clay Shafer, in front of a live studio audience at devopsdays Minneapolis 2017.
Sys Admins, DevOps, SRE. Oh My! - Bryan Liles’ opening keynote
Security In A Containerized World - Jessie Frazelle’s closing keynote
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Use code “ADO2017” for a discount on many devopsdays.
Bridget and Matt chat with Daphne Chong (Amazon) and Kenny Bastani (Pivotal).
Daphne’s GOTO Chicago talk: Video Transcoding at the ABC with Microservices
Kenny’s GOTO Chicago talk: In the Eventual Consistency of Succeeding at Microservices
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget and Matt chat with Jérôme Petazzoni (Docker), Mark Heckler (Pivotal), and Jennifer Heckler (Edward Jones).
Mark & Jennifer’s GOTO Chicago talk: Clouds & Containers: Hit the High Points and Give it to Me Straight, What’s the Difference & Why Should I Care?
Jérôme’s GOTO Chicago workshop: Container deployment, scaling, and orchestration with Docker Swarm
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget and Matt chat with Nicole Johnson (Chef), Matt Curry (Allstate), and Anthony Lee (Allstate).
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget chats with devopsdays Toronto local organizer Amy Mansell & speakers Roderick Randolph, Arthur Maltson, and Aaron Aldrich.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget and Matt chat with Jeff Smith (Centro) and Mark Imbriaco (Pivotal).
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget and Matt chat with Nicole Forsgren (DORA) and Tim Gross (Joyent).
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget and Matt chat with Andrew Clay Shafer (Pivotal) and Bryan Cantrill (Joyent).
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget and Matt chat about devops in a large enterprise with Bryan Liles (Capital One).
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget chats about startups with Charity Majors (Honeycomb) and Nicole Forsgren (DORA).
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
DevOps Cafe w/ Jeffery Snover - Linux is docs based, Windows is API based
Bridget and Joe discuss their experiences at devopsdays Cuba and share audio from the closing session.
A few videos of devopsdays Cuba have made it to youtube. They can be found [here.] (https://www.youtube.com/watch?v=bTMwHcjpMf4&list=PLobspijdw3822IEFotAHz_vWrc0nTqwxn)
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Matt, Trevor, and Bridget chat (at length) about podcasts, podcast recording, and podcast recording software. Oh, and the highlights of 2016 if they get around to it. (Don’t miss the supercut of all 2016’s cold opens, which was edited by Joe, even though Matt takes credit for it!)
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Bridget and special guest host Matt Ray of Software Defined Talk chat with Matthew Jones, Lindsay Holmwood, Mick Pollard, and Katie McLaughlin at devopsdays Sydney 2016.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
Distributed systems, service discovery, load balancing: Service Discovery at Stripe
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
For any devopsdays, try the code ADO2016! It should get you 20% off.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
For any devopsdays, try the code ADO2016! It should get you 20% off.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
For any devopsdays, try the code ADO2016! It should get you 20% off.
Don’t forget to check out the book itself! The Art of Monitoring.
Back in the day, James also wrote a book called Pro Nagios 2.0.
Three stages of monitoring maturity:
“You will eventually get to CPU, memory, and disk, but a lot later after you start with the things you should really care about” - James
“Too much monitoring is binary - this thing either works or it doesn’t” - James
Also mentioned:
Where we’ll be for the upcoming fortnight
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
For any devopsdays, try the code ADO2016! It should get you 20% off. Also now works on O’Reilly Security conference.
For more DevOps awesomeness, check out the Chef Community Summit, October 26th and 27th in Seattle, WA. This Open Space event provides a great opportunity to connect with the DevOps Community and Chef Engineers over two days of engaging sessions and hallway discussions. Bring your ideas, passion and excitement for Chef and DevOps to this highly interactive event. Go to summit.chef.io to register for this awesome event and use the code ARRESTEDDEVOPS to get 10% off your ticket!
Bridget sits down for a classic fireside chat with Bryan Cantrill (Joyent), ranging from containers to social justice to lawn care.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
For any devopsdays, try the code ADO2016! It should get you 20% off. Also now works on O’Reilly Security and Velocity conferences.
Matt and Bridget chat with Michael Hedgpeth (NCR) and Doug Ireton (1Strategy) about organizations adopting open source.
Image credit: https://flic.kr/p/3V99c8
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
For any devopsdays, try the code ADO2016! It should get you 20% off. Also now works on O’Reilly Security and Velocity conferences.
Tim Gross (Joyent) and Adam Jacob (Chef) independently started solving the problem of how to put applications in control of their own configuration. They discuss Habitat and ContainerPilot, to the edification of Matt and Bridget.
github.com/joyent/containerpilot and example applications at github.com/autopilotpattern
The Art of Closing - open source maintainer advice from Jessie Frazelle
Badass by Kathy Sierra
DevOpsDictionary.com - I’m having a bit too much fun contributing to it.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
For any devopsdays, try the code ADO2016! It should get you 20% off.
How do large enterprises transform the way they do IT? What does it mean for every company to become a software company? Our panel of experts at devopsdays Minneapolis 2016 has worked in some of the largest orgs out there and has seen a lot of transformation first-hand.
Awesome videos from ChefConf:
Other delightful stuff:
Notes from 2016 DevOps Days DC session on DR for your career: http://e.devopsdaysdc.org/p/openspace-spades-A
Severance agreement timelines: Employee Checklist: What to Do When Your Employer Offers You a Severance Agreement … Esp: “If your employer has not given you a reasonable amount of time, or rushes your decision, this is a red flag…If you are being rushed, ask for more time [in writing]”
How to bounce back: Let folks know - Mike Fiedler and Annie Hedgpeth had good examples on Twitter recently.
#cheffriends, I’m interested in working w/@Chef or #security in #DevOps #DevOpsSec remote or DFW. Would love your help!
— Annie Hedgpeth (@anniehedgie) June 30, 2016
Ok recruiters. I'm open to hearing from you now. Make it good, let's not waste each other's time.
— Mike Fiedler, Code Gardener (@mikefiedler) June 6, 2016
Use all your networks
Want to dig deeper into some of the things we discussed today? Check out these past episodes:
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
For any devopsdays, try the code ADO2016! It should get you 20% off.
We have t-shirts now! And mugs! They are available at store.arresteddevops.com! Only unisex for now, but more styles coming! Buy one today. Or not. We’re not the boss of you.
If you have an upcoming conference you would like to see promoted on ADO, you can fill out the handy form at arresteddevops.com/conf
For any devopsdays, try the code ADO2016! It should get you 20% off.
For any devopsdays, try the code ADO2016! It should get you 20% off.
We have t-shirts now! And mugs! They are available at store.arresteddevops.com! Only unisex for now, but more styles coming! Buy one today. Or not. We’re not the boss of you.
Ben:
Jessie:
Bridget:
Effective DevOps by Ryn Daniels & Jennifer Davis
Lara Hogan - Day-of-talk countdown
Bridget - tl;dr: Your Talk is Accepted
Ryn Daniels - On a Conference Speaking Routine
Draconian - Sovran (Oct 2015)
Walls of Jericho - No One Can Save You From Yourself (March 2016)
For any devopsdays, try the code ADO2016! It should get you 20% off.
We have t-shirts now! And mugs! They are available at store.arresteddevops.com! Only unisex for now, but more styles coming! Buy one today. Or not. We’re not the boss of you.
Empathy is CI For The Soul by David Shackelford
This was a special co-production with The Goat Farm. Don’t forget to subscribe to their Enterprise DevOps podcast on iTunes or Stitcher!
Panel Discussion from ChefConf 2015: Have Your Bets on Open Paid Off? Moderated by Cade Metz, Wired Magazine Panelists: Mark Russinovich (Microsoft), Jeff Arcuri (Gap), Phil Dibowitz (Facebook)
Errata: “We’re no longer an airline. We’re a software company with wings.” Matt’s quote at 17:02 was from Alaska Airlines, not United Airlines.
Software Defined Talk (they need a more memorable URL)
Various links referenced in the episode!
Matt and Eric are pretty sure the only reason that Red Hat acquired Ansible was to get Robyn Bergeron back.
Matt spends the entire episode claiming that Nathen was famous for being on ADO11, when in fact it was ADO14.
The Reddit post referenced in the episode:
Having a difficult time wrapping my head around test driven infrastructure
ChatOps is used by many teams and companies as the main communication tool for day to day chat, and their most important activities. In fact, ChatOps may be taking the place of email in the workplace for internal communication for tech teams as it helps communication during DevOps activities like deploys, code pushes, etc. This episode discusses best practices (if there are any) of ChatOps and how to make sure you are getting the most from your team communication tools.
The panel comes to the conclusion that important decisions should lean away from ChatOps, and into a more formal, permanent form of communication. “Important things will ‘re-bubble’ again,” but the chatroom is not the place if a team consensus is needed, especially if the team is remote.
Matt, Trevor, and Bridget catch up with Kyle Kingsbury about his research on failure in distributed systems, lighting rigs controlled by code, consent and representation, and more.
Call Me Maybe: an exploration of failure in distributed systems using the Jepsen tool.
Monitorama 2015 talk on Riemann
Bridget announces that she’s joined Pivotal; Matt claims this is to diversify the podcast so it’s no longer Chef employee, Chef partner, Chef customer.
Lots of open CFPS on devopsdays.org
Chef Community Summit is Oct 14 and 15 in Seattle. Matt & Trevor will be there.
Matt & Trevor: at DevOpsDays Chicago August 25 & 26!
Bridget: at VMWorld the first week in September.
Charity Majors (Parse/Facebook) is an “Accidental Computerer.” She was the first infrastructure hire at Parse, which was acquired by Facebook in April 2013. Charity handles all of the backend operations and DBA work, and manages a team of 7 engineers.
Patrick McDonnell is a web operations manager at Etsy. He made the transition from individual contributor to management a few years ago.
Mike Rembetsy (MCR) is the VP of Technical Operations at Etsy. He was one of the first ops people at Etsy when he joined in 2008, and has helped grow the team to over 50 engineers since then.
Charity points out that your first question should be whether or not you actually need an ops team. She says, “There are a lot of places out there that think they need traditional operations engineers, when all they really need is someone to really care about their infrastructure… You should have genuinely hard operations problems before you even start looking to hire engineers.”
Once you do start down the path of building an ops team, MCR notes that you have to maintain it. You’ll need to grow the team, grow the individuals, grow the culture, which, if your company is in a startup phase, can be distracting to the overall goals.
“The glue that holds a team together is how well people interact with one another, how well they respect each other,” says MCR. “That’s what you build good ops teams on, and frankly, that’s what you build any good team on.”
As a growing company, we suggest you follow this process:
Trevor agrees that cultural concerns are absolutely an issue, and asks for suggestions on how to interview for that.
MCR explains that at Etsy, they split their interview questions into technical questions and cultural questions about how the interviewees handled particular situations, as well as other skills.
Charity notes that good ops engineers are good at learning things, but some people freeze up when they’re put on the spot. She suggests providing interviewees with at least 50% of the questions beforehand, so they have a chance to prepare the preliminary information before interacting with her in a formal interview. “You want people to bring you the self that you’re going to be working with on a day-to-day basis,” she says, “not the self that is freaking out and wondering how they’re being perceived.”
Transitioning into the topic of management, Charity brings up the point that if, as a manager, you’re still responsible for key pieces of the infrastructure, you’re holding your team back in their technical development.
One of the jobs of a manager, MCR says, is to help motivate your team, and then step aside and let people get things done. In doing this, you let them succeed, as well as fail, and grow as a result of those experiences. He references Daniel Pink’s book, Drive, and the three pieces of science that motivate human beings: autonomy, mastery, and purpose.
“A manager’s role is a facilitator,” says Patrick. “Everyone should be doing what they think they need to do. It’s my job to remove obstacles that come in their way and make sure I can smooth things out if need be, but really, it’s to encourage and allow people to reach their full potential.”
Etsy has two separate career paths: one for individual contributors and another for management, which allows for the honing of particular skills specific to the end goal.
Charity agrees wholeheartedly with this approach, and says emphatically, “Management isn’t a promotion. It’s a career change.”
In addition, manager and leader is not synonymous. Some people are much better leaders than they are managers, and vice versa. It is also possible to be a leader while continuing as an individual contributor. In fact, Patrick points out that in some circumstances, you might actually lose influence when moving into a management role if you’re already a leader amongst your peers.
Bridget asks the panel, “What’s your best advice for someone in the position of building an ops team?” Patrick: “Focus on hiring good people. Hire people that you like. Hire people that you trust. Hire people that, maybe they need to do a little more research, maybe spend a little more time on StackExchange than the next person, but you know that they’re going to get the job done in the way that you need to get the job done.”
Charity: “Build your networks. Go to meetups, talk to people. Don’t just talk to the popular kids. Reach out to diverse communities and diverse crowds, and go meet people who are doing cool and exciting things, who are slightly off the beaten path. The more people you know, the better you’re going to be at hiring.”
MCR: “Make sure that you address conflict. Make sure you create a safe place for the people you do hire, to have open, honest conversations with one another. There’s nothing more toxic to a team than people chatting behind other people’s backs.
MCR: – Open Source Utility for ELK – 2012 Velocity Talk from Dr. Richard Cook: How Complex Systems Fail Charity: – Guided Meditation for Adults: “Breathe in strength, breathe out bullshit.”
Patrick: – Dr. Christina Maslov’s talk at Velocity: Burnout in Tech – 5-minute burnout self-assessment
Bridget: – DevOpsDays Minneapolis talks
Trevor: – Batman Arkham Knight – GitKraken
DevOpsDays at #alltheplaces
Andrew Clay Shafer (@littleidea).
Coming to you live from DevOpsDays Minneapolis, Matt and Bridget sit down with Andrew Clay Shafer in front of a live audience to talk about the growth of DevOps, explain some commonly heard but not always understood terms, and more (after a brief detour on why episode numbers on podcasts are obnoxious, and why this episode is titled “Eating Sushi with Andrew Clay Shafer”).
Don’t know who Andrew is? He suggests you Google him, but then goes on to give a little bit of his background: he’s been involved in software development and technology for almost 20 years. After rooming with Luke Kanies, founder of Puppet Labs in college, Andrew got interested in operations and system administration.
O’Reilly’s Velocity Conference was also influential in Andrew’s growth, and Andrew reiminisces about his presentation on Agile Infrastructure in 2009. John Willis speaks up from the audience, and strongly suggests going to look at the slides.
Matt takes a moment to explain how Arrested DevOps started: “I started listening to John and Damon on DevOps Cafe and understood about 5% of what they were talking about… There are these things that we, as part of this community, tend to know, and what we try to do with this show is break it down for the people who don’t have the tenacity or stubbornness that I did.”
Along that vein, Matt asks Andrew to expand upon the “wall of confusion” idea that was referenced in his 2009 talk, and has become a commonly-used (but not always understood) term in DevOps lingo.
“It’s a jargony way to talk about the different incentives that exist between developers and operations,” says Andrew. “There’s a transition that happened as software became service-oriented, versus shipped on CDs, where the servers now become this critical part of the value chain, and if you deemphasize the system administration and operation of those servers, then you don’t actually have software. In the middle of these two worlds, where in one, systems administrators were for keeping the printers and the mail server up, to where they’re a critical part of the value chain in the new world, there are broken IT practices that don’t make sense when you’re trying to manage a service. It means recognizing that the best way to optimize a system isn’t to just throw random stuff onto production servers, and then make it ops problem, but to recognize that the infrastructure itself has become an application, and that you can manage these things as an application.”
Andrew points out that as much as he enjoys the attention (and who wouldn’t?), he was simply in the right place at the right time, and the right people listened to him. He connected dots to take advantage of tools and practices that were already used to manage software process, and bring that into the infrastructure and operations works.
Bridget asks, “You mentioned Agile, and you mentioned Scrum. I’ve heard you say that Scrum is a disease. Can you give us your thoughts on where that sort of stuff is going?”
“My personal opinion is that Scrum’s impact on software development is net negative,” Andrew says. “I think it’s particularly bad when people try to adopt it in operations. It’s really susceptible to problems when you have any interrupt-driven work whatsoever.” He suggests pursuing kanban and chatops, making work explicit and visible – tools that allow both you and your management to understand the full context and value of the situation.
The conversation transitions into talking about how to make actual changes to your operations and infrastructure teams, rather than always jumping through hoops to make the necessary changes to keep the pages up or the apps running smoothly. The answer isn’t to simply communicate how difficult it is to manage a system – upper management won’t understand the pain, and therefore won’t listen to the complaints. You have to involve the rest of the team so that they understand what you’re going through first-hand. You get empathy from suffering. This all plays back to Conway’s Law, as Andrew points out: “If you believe Conway’s Law is true (as I do), then you understand that your org structure (who communicaties with whom, who reports to whom, etc.) determines the outcome of any decision.”
Bridget brings up the point that this is the essence of dogfooding: requiring not only your engineers to be in the code, but your employees to be using the products that you’re creating, so that there’s a general understanding of why things work the way that they do, and a buyin for the necessary changes.
Bridget asks Andrew to expand more on what he thinks we are (and should be) optimizing for, which he touched on briefly during his talk at DevOpsDays.
Andrew counters that in order to do that properly, we need to first frame the context, which is a problem that plauges DevOps, Agile, and many other systems with which people are trying to transform their companies – you can’t do something prescriptive until you have enough context to understand where you’re starting from. For example, the diet and exercise program you’d give to someone who’s relatively healthy and active is very different than someone with a different set of circumstances. By the same token, you can’t prescribe a solution to an infrastructure problem without first investigating the roots causes and understanding what the foundation is.
However, if you model the world as everything is an agent trying to maximize some function, then the basic premise of your decisions is cause and effect. “Looking at the way people behave, and how this plays out within organizations, you might have very different patterns of interactions and patterns of health.” Andrew continues, “Therefore, what you’ll tell people do is very different from context to context.”
Despite all of the different scenarios, Andrew argues that there are three things you can always do:
Wondering what the Nash equilibrium and Pareto efficiency game theories are? Here are a few links:
Jeff’s background includes 30 years of experience working in people management. Specifically in developing the management skills, and careers of Software Engineering teams. He now works at Chef as the Director of Learning Experiences.
Just thinking about career development in the workplace is not very common. Jeff discusses statistics around the specifics of Career Development and implementation strategies. For example, in one study, more than 60% of respondents said that Career Development was of “little to no” importance to their current employer. Less than 5% of employees at some organizations surveyed receive any career feedback from their current bosses.
Jeff: There are different approaches people usually take as they grow into their careers. The first is a “go with the flow” approach which can cause issues when leadership skills are not developed in response. A second approach is more proactive in which you plan out and describe your career and where you would like to go in the future.
The panel discusses their own careers and their attempts to summarize “what you do,” their weaknesses, and their strengths. Especially if the description is to be seen by the entire organization. Can you brag? Should you be vulnerable? For some, listing strengths as an engineer can actually be more difficult than listing weaknesses.
Jeff: “It’s a huge act of vulnerability to say who you want to be in an organization”
Jeff: “Every Engineer is responsible for their own career development […] and you are 100% responsible for the career development for every engineer on your team”
The group discusses the merit of titles and how relevant and useful they are within an organization. Jeff: “Titles don’t matter, and they absolutely matter.”
Ultimately, titles should be descriptive of what you want to do in that position and are indicative of positional authority more than anything else.
“It’s Not a Promotion - It’s a Career Change” — Lindsay Holmwood (http://fractio.nl/2014/09/19/not-a-promotion-a-career-change/)
Matt: “I don’t like managing people that’s not my thing.”
Jeff: Performance management is not the same as Career Development. Often, when Software Engineers get promoted to managers, coding becomes a secondary responsibility and People Management becomes a priority. For some, this is not the trajectory they want for their careers, and that’s ok. Providing context for feedback and guidance is a great way to identify these mismatches between current position and where someone wants to be.
Jeff: Using analytics “Project Oxygen” from Google describes 8 qualities of a good manager. ( http://www.nytimes.com/2011/03/13/business/13hire.html )
Jeff: “The Beatle Book” - Ken Schwaber ( http://www.amazon.com/Ken-Schwaber/e/B001H6ODMC )
Trevor: Doing an emotional check-in to describe your current emotional state is a powerful management and communication tool (from the Core Protocols by Jim McCarthy - http://www.mccarthyshow.com/online/)
Jeff discusses the usefulness of check-ins on every level within an organization.
You know you have a good manager when: 1. They expresses real concern with your career development, separate from quality conversations. 2. They create opportunities for you to realize your goals. (or at least get closer) 3. They have your best interest at heart.
…and then there’s the checkouts:
Stephanie Van Dyk @sevandyk is an SRE at Google, and has also worked on healthcare.gov.
Mark Imbriaco @markimbriaco is co-founder and CEO of OperableInc. He’s worked previously at DigitalOcean, GitHub, LivingSocial, Heroku, and 37signals.
Bridget starts by asking Mark what Operable is all about. Mark explains that Operable is trying to help people who are on the “pointy end” of incidents. They’re trying to build tools that help people collaboratively fix problems. “There’s a lot of tools these days that do things like wake you up and alert when you when there is a problem,” says Mark, “but we think there’s a lot of room to help people actually solve problems.”
Stephanie briefly goes through some of the history of healthcare.gov, and how she first learned about it. Her position was unique, she points out. “We worked very hard, and very long hours… We were also in the fortunate position of having a lot of authority, which is important if you’re trying to fix a disaster. There’s a lot of problems to solve, and you don’t want any of your additional problems to be ‘Well, who gave you permission to do that?’”
“That’s a really good point – not all disasters are created equal,” Bridget notes, “and maybe we should take a step back and think, what are the ingredients that make something a disaster?”
Mark: I’m used to catestrophic problems that last for a few hours at most, or in the really bad case, mabe it lasts for two or three or four days, not something that goes on for weeks or months, so that’s a different perspective from what I have, so I’m super interested to hear about [healthcare.gov].
Matt: I think there’s the disasters where there’s a thing that happens, that’s maybe localized to one type of scenario; then there is what happens in an episode of This American Life, where it’s just one thing after another and everything unravels. There’s a quote from the episode that I like to think about when we think about these bigger disasters that are more than just an outage that may be far reaching:
“One ingredient of many fiascos is that great, massive, heart-wrenching chaos and failure, are more likely to fail, when great ambition has come into play, when plans are big, expectations are great, and hopes are at their highest.”
“I think you’re certainly right,” agrees Stephanie. “In order for something to be a disaster, the stakes have to be quite high… An outage that you find, and fix, and write a postmortem, and everyone learns something, and the users all get over their hurt, that’s not a disaster. That’s just life.
At times, there are incidents that leave scar tissue in their wake, making people wonder for years to come if they truly want to use certain products, or trust their data to a certain company. Mark reminisces: “The gutwrenching terror is, are we going to get our customers’ data back, or is it just gone? As an ops person, there’s almost nothing more terrifying than losing data.”
This provides a perfect segue into some of the non-obvious issues that arise with disasters. Stephanie brings up the point that you have to be prepared to regain the trust of your users. “How people think about your service is going to determine the fallout of it, and the impact. It’s interesting – it’s not something engineers like to think about very much, because they simply fix the problem. But someone has to be the one to reassure people that it’ll be ok.”
Mark agrees: It’s really, really hard to be in the middle of responding to a serious problem, and also have to be the person who needs to communicate about that externally. There’s so much good will that can be gained from being as transparent and public as you can about what’s going on, without pulling punches or hiding, even if things are really bad.
This is all well and good, but Bridget brings up a good point: “How do you know exactly how and what to communicate to people?”
Stephanie: There are definitely rings of communication. You have to be able to talk to the other engineers who are working on the problem, and those conversations are going to be very different than how you talk to your customers, even if you’re trying to be super open and honest. Your customers don’t care about where in the logs you found that tiny error. They care about when it’s going to be fixed, and whether you’re actually working on it… Also, the person who’s in charge of solving the outage should not be the same person who’s in charge of communicating about the outage. You should have different roles for that.
Mark agrees emphatically, and also noted that wording is incredibly important – not only what you say, but how you say it, and the words that you use. “There are three things I want to get across. The first thing is, I want to apologize to people. It has to be a sincere apology. The other thing I need to do is make sure people feel confident that I understand what happened. I need to display confidence, and a really firm grasp about the problem. The last thing I need to do is tell them what I’ll do to try to reduce the likelihood of something like this happening again.”
The conversation turns a corner as Matt asks how you plan for outages and prevent disasters. Stephanie jumps in, and reminds us all:
“If you don’t test your backups, you don’t really have backups. Similarly, if you don’t test your outage plan, you don’t have an outage plan.”
She suggests setting up brainstorming sessions with a handful of people from your team, appointing a “DM” (dubbed “Disaster Master” rather than “Dungeon Master” by Tyler), and running through possible scenarios. Keep an eye out for a Kickstarter in the near future ;)
There are definitely advantages to documenting incident reports along the way, but how do you balance the speed of talking through a solution out loud, and the value of face-to-face communication to build trust vs. the need to document things for posterity? Mark: How you interact on a day-to-day basis is also how you should communicate during an outage. The last thing you want to do is change your mode of communication when everything is falling apart and you’ve got high stress.
Matt posits that sometimes what’s a disaster for one company isn’t for another, because of their size, their logistical capabilities, etc., but also, sometimes what is being presented as a disaster isn’t actually all that bad.
Stephanie identifies the first benchmark as determining whether or not your users are hurting. “If they’re not hurting yet, you might have a disaster coming, but I don’t think it qualifies. But if your users are hurting, that’s when you really need to jump on board and get focused.”
Mark agrees, and adds that being able to quantify how many users are affected, and in what way they’re affected, is hugely important. “That’s different than monitoring. Monitoring may tell you that the server’s down, but it doesn’t tell you how many users that impacts.” He reminds us that when you’re working at scale, “services are down for somebody literally all of the time. “What is the threshold where it becomes a disaster? When do you need to start talking about it publicly and in status? Those are questions you really need to answer up front.”
Chris Read, Kevin Hubbard, and Yvo van Doorn are reformed BOFH’s (Basterd Operator’s From Hell).
Chris is back on the podcast again, this time talking about his expereince as a SysAdmin in past lives.
Kevin is currently the DevOps Engineer for BCycle at Trek Bicycle Corporation, and was a SysAdmin for 15+ years.
Yvo previously worked at classmates.com and McGraw Hill Corporation as a SysAdmin, and is now at Chef Software.
Before we get started, you’ll need to understand the origin story of BOFH. There were stories posted on Usenet back in the 90’s, supposedly authored by a computer operator named Simon whose sole purpose was to terrorize the users of his systems. The phrase “How great would my job be if it weren’t for the f***ing users!” resonated with many SysAdmins (and still does!).
Matt starts the episode off asking what it was like as everyone was starting out in this field.
Kevin: To me, it was sort of operating in a scarcity model. You had limited resources, and it seemed like anytime there was a new ask for an application, I immediately went to ‘How am I going to ask for the capacity to run this?’ and I would just get so frustrated. It boiled down to ‘How are we going to support this?’ That was my standard line when someone would bring up something new, and I wish I had trophies to give those people for all of their good ideas, because we just couldn’t get it off the ground with the resources that we had. It wasn’t a fun way to operate, but it was the most realistic view.
Chris: When in high school and university I was the System Administrator for the school systems. It was astounding seeing what damage to the system can be done – how people trying to do something could affect shared resources, and the after-effects of that. Most of the time it wasn’t malicious; it was due to ignorance, but it built up this mental attitude of ‘All users are just there to break things. We need to constrain them as much as possible, because when things break, we’re the ones that get shouted at.’
Matt: There was a belief that devs are stupid! All they’re going to do is break things, because they don’t care about the systems like a SysAdmin does, because they’re ours.
Yvo: Our devs were incentivized not to care because they were paid based on the amount of code they shipped. I’ve had some nightmare evenings trying to fix all of the problems.
Bridget then brought the conversation back around to incentives – are there situations when the incentives are diametrically opposed (or at least not aligend well) between the SysAdmins and the developers?
Matt brings up the point that developers are incentivized to build features, while SysAdmins are incentivized to bring stability, which at its most basic level is maintained by things not changing.
The viewpoint of ‘developers don’t know what they’re asking for’ is also a problem, Kevin reminds us. SysAdmins will often call the developer and explain why things work the way that they do, but won’t take the time to listen to the actual problem.
In reality, there’s a perception of other people touching “our stuff” and things will go wrong, but let’s face it: “there are all sorts of things that can go wrong that are often not a specific person’s action,” says Bridget.
Given that all of us here are supposedly reformed BOFH’s here, let’s chat about how things have changed, and what that process was.
Chris: I finally realized that my interactions with the developers were better if I went to them without the ‘clue stick’ and simply spoke to them, asking them if they realized the impact of their code. It finally clicked for me when I had to work together with the client-side SysAdmins as well as the developers at Thoughtworks. Our whole purpose was to get code written by two different development teams out into production, and it was only through being an advocate for both teams that I was able to build up a relationship with both teams and understand the value.
“It seems like DevOps has formalized the relationship between SysAdmins and developers,” says Kevin. “It seems like a much more natural, iterative process working with devs.” Because we’re working side by side, there’s much less going back and forth with having to figure out the direction and purpose behind projects, and simply getting to collaborate.
The “handing down stone tablets” philosophy not only no longer works… it has never worked!
In Yvo’s case, the change started to happen when the project management team was dissolved. Suddenly, a SysAdmin had to be a part of the development meetings, because there was no longer an intermediary passing information from one team to another. It immediately became more collaborative, and there was visibility into what was happening early on rather than being notified after everything was finished.
We’ve shifted from a mentality of ‘protection’ – teaching our “PFY’s” (pimply-faced youths) to protect their systems against the evil developers – to giving history lessons about how we got to the stage that we’re at now where we need to talk to all of the involved parties, and as Chris said, “having everyone focused on the goals, trying to see things from each other’s angles rather than antagonistcally.” This is how we, as a group, move forward.
“It takes a deliberate decision to shift,” Bridget observes. We have to be dedicated to teaching our PFY’s this new, collaborative way so that in the future, fewer people will start out with this BOFH mentality.
From there, we shift into How can we do better?
Matt asks, “We used to be this way – we’re better now – but what are some of the ways that we can still improve?”
“I want to be able to maintain this new flexibility that comes with DevOps, but I feel like there’s some decision-making that needs to be made as far as tools and standards go,” says Kevin. It’s a matter of balancing the old playbook of limited resources and mixing in the new cohesion and collaboration efforts.
We’ve done a great job of bringing in the greater teams of operations and developers, but most companies still have the one or two lone SysAdmins who are struggling on a daily basis to keep their heads above water. Yvo cautions that we need to bring them into the circle as well.
“If we can make their lives easier, they’re going to eventually go to another shop with the perception that being alone and supporting developers is not a bad thing, but right now they really don’t like life.”
Bridget’s money-back guarantee: “If you’re less BOFH-y, I promise you’ll be happier, or else we’ll pay for your therapy.”
Closing Keynote by Anita Sengupta from NASA JPL was awesome
James Lewis - “How I finally stopped worrying and learnt to love Conway’s Law”
Shannon Smith is the marketing manager for 10th Magnitude, a consulting firm based in Chicago. She’s a self-described “liberal arts nerd” and fell into the tech world through her job at 10th Magnitude. She has a love for well-crafted sentences, song parodies, and general cleverness, which has translated into a love for marketing and the challenges that come with getting an effective message delivered to the “great abyss.”
Jason Hand is a DevOps Evangelist at VictorOps. He’s been in the IT space since college, but it wasn’t until joining VictorOps that he’s ventured into a wider role, working closely with both marketing and product.
Matt starts off this unique episode by posing an important question: “Why is DevOps something that people in marketing would even care about?”
Jason jumps in right away: “Obviously, the VictorOps tool is something we feel falls in line with a lot of the DevOps topics, in terms of the things we try to help engineers with. Being able to speak intelligently to the different subjects that fall within DevOps is very important, especially when it comes to marketing. As most of us know, we’re very averse to traditional sales and marketing tactics; we can sniff out those types of conversations before they even happen, and we’ll pivot and walk away.”
Shannon originally broached the topic during Open Spaces at DevOpsDays Chicago 2014. She was interested in the focus on community, but also intrigued by the jokes about how terrible traditional sales and marketing is. She wanted to know what she could do to combat the negative feelings surrounding these departments, as well as gain insight into what her team could do differently.
The group starts throwing out common pitfalls:
“Even for those of us who are heavily invested in this DevOps thing,” says Jason, “who talk about it on a daily basis, sometimes we don’t even understand everything – all the ins and outs of what is DevOps – because it’s not just a thing. It’s a way of getting things done. It’s much bigger than just one thing. We are very patient and empathetic to that thought process,” he continues, “because we deal with it all the time. That’s part of the challenge within marketing, is how to turn that conversation into a succinct way of explaining it.”
Shannon explains that part of the problem with having a succinct explanation for DevOps, as well as how to sell it, is that there are multiple target audiences. “While we do use more traditional marketing for ‘decision makers’ – people who are busy and don’t have the time to talk about these abstract concepts, we’ve recognized that there are two or three very separate audiences, and we have to make multiple different messages work.”
The point is, your message is going to be very different depending on whether your audience is practitioner, decision maker, or champion. It could be that your company caters to all three, but then you must have separate messages catered to each of these groups. Otherwise you’ll be trying to mandate change from the top-down rather than getting the buy-in from people doing the actual work, or vice versa.
In a conversation with Matt earlier in the week, Steve Pereira said, “I’m in favor of having marketers speak to what they’re actually selling, which is never actually ‘DevOps.’ DevOps is bathwater in which all of these things are babies.”
Given all of these different audiences, Bridget asks the “elephant in the room” question: “When you’re talking about the target markets, how do you build an understanding of DevOps among people who aren’t engineers?”
Jason mentions that at VictorOps, they have an emphasis on moving agiley across the company. Even the marketing team works in sprints, getting things done on a project basis and having team standups. DevOps isn’t just for engineers – it’s applicable for everyone across the company.
But how do we make that happen in a practical sense?
Companies that are doing this successfully often exhibit certain qualities:
This cross-company communication can be difficult at times, but ultimately it’s everyone’s responsibility to speak up if they see problems with the messaging. Bridget offers a solution: “If you’re going to have people external-facing in your organization, going out and evangelizing, they need to also be talking to the people inside the organization, who are developing the product or providing the professional services.”
With Jason’s role as DevOps Evangelist, he’s often out learning from the best. It’s his responsibility to bring the feedback directly back to the company and educate them on what he’s hearing from the community, and likewise it’s marketing and product’s responsibility to be open and willing to hear the feedback.
Collaboration and accessibility is what drives all of this, as Shannon pointed out. Being able to talk to people face-to-face is one thing, but using chat software to check in with the full company, or somehow checking in frequently with multiple teams is key.
For our listeners: If you work in a larger organization, we’d love to hear about how you’ve solved this problem of communication throughout the company. What’s resonated, and what hasn’t? Tweet your answers to us at @arresteddevops.
Main takeaways: Jason - Spend some time researching DevOps. Get to the DevOpsDays events that are happening in your area. That’s where I’ve learned the most. I’ve made some great friends and contacts. Also, these events have helped me have the sense of empathy not only toward the challenges of the marketing and sales teams, but also the conversations that are taking place among all of the business units, no matter what size company. I can now take what I’ve learned, both online and offline, and take those insights and thoughts back to my team. For the time-being, I’m the one internally who’s teaching DevOps best practices. It’s really a matter of absorption. You can’t sit behind your desk and understand DevOps. You really have to get involved with others.
Shannon - Truly the biggest thing that I’ve taken away in the past 9 months is really trying to build up our grassroots messaging and the way that we’re reaching out to people. I think the best way to do that is go to these community events – be involved, listen, share, have natural conversations, and be genuine. Having a presence in the community says a lot more than any little tagline.
Jason
Bridget
Trevor
Matt
Seth Falcon is the Engineering General Manager for Chef Delivery.
Julian Dunn is the Product Manager for Chef Analytics.
Adam Edwards is the Engineering General Manager for the main Chef product.
Matt, Trevor, and Bridget gather at the Chef Newsroom at ChefConf 2015 to talk about the latest and greatest developments at Chef. They are joined by Seth Falcon, Julian Dunn, and Adam Edwards.
One of the biggest announcements at ChefConf 2015 was Chef Delivery. Matt asks what is Chef Delivery, and why is it interesting?
“Chef Delivery is a solution for continuously delivering infrastructure and applications, and it’s built on top of Chef,” says Seth, who’s heading up this new program within Chef. “We’re really excited about Chef Delivery. Successful software organizations use patterns to deliver software at high velocity collaboratively and safely. We’ve been able to distill some of those patterns into a workflow that we think will be easier for folks to adopt and learn.”
Seth continues: “The workflow is one that we’ve seen work, by working with customers to build these types of pipelines over and over again, and find those successful patterns. The overall workflow begins on a developer’s workstation, where they make a change, they do some local testing, and then submit it to the system, where some automated verification tests run. The job of those verification tests is to determine whether it’s worth the time of a human to do some code review on that change. Someone can then do some code review to approve the change. At that point, Delivery will build an asset for us that we could release. The workflow for building that asset is simple: do a merge onto the target branch (usually the master), rerun the same verification tests, which usually consists of unit tests, lint testing, and syntax checks, build that asset, and publish it into a repository where it can be fetched later. Then Delivery will provision an acceptance environment, should you need one, and deploy that asset into that acceptance environment, and run some tests to make sure that the deploy was successful. If it was, it waits there for further instruction. The last step is clicking the ‘Deliver’ button. That sets the system in motion to get the code all the way out. It first goes into a ‘Union’ stage, where if you had a number of projects that had some interactions, you’d be testing them together at their latest version, and making sure they’re good. If those tests succeed, Delivery rolls automatically for the rest of the stages – into a rehearsal environment, and then into a delivery environment.”
Interested in a more detailed description of the workflow? Seth continues to talk through some of the logistics in the podcast as Bridget and Matt ask questions about particulars. You can also check out this video about Delivery:
Seth concludes with, “A lot of what we’re providing here is an accelerant to teams, to give them that system that will allow them to move quickly and learn how to move quickly in that way.”
We transition into asking Adam Edwards, Engineering GM for the Chef product, what’s new in his world.
“There’s a lot of new stuff just coming out over the last few weeks,” says Adam. He goes on to detail just a few new features:
From here, we move into Chef Analytics, and turn the mic over to Product Manager Julian Dunn. He gives us a quick background on Analytics, explaining that it “gives you a way to visualize, query, and report on the events stream that’s coming from your Chef data. Analytics lets you not only visualize that data, but track events in that stream, and then handle them in various ways.”
Matt brings up his favorite part of Analytics, and brings up how it’s closely affiliated to security: “You can take those same audit rules that you write in Analytics, and apply them through your pipeline to test them there. What’s nice is that it’s only one thing to write.” It keeps things simple and straightforward, rather than needing to search through a huge amount of rules and regulations to ensure that everything is accounted for.
Julian agrees: “You can think of security as just another aspect of quality. We understand that you can’t get quality if you try to bolt it on to the back of a system. How many of you have worked with applications that didn’t have tests originally, and the software is just poor quality? We tend to treat security in this backwards way, where we think if we don’t build it into the system, down the road we can just do an audit and we’ll magically get compliance and security. If it’s not a characteristic that’s already there, it’s very difficult to achieve those directives.”
In addition, Analytics allows customers to report metrics and characteristics about your infrastructure to business owners. “One of the things we often hear from customers,” Julian says, “is how do I measure how successful I am at Chef?” If you’re able to use Chef Analytics, it provides you with a direct way to illustrate the ROI of investing in this type of technology.
This business aspect of Chef Analytics was one of the core announcements regarding the Analytics Product Suite at ChefConf, specifically highlighting the integration with Splunk. Julian explains the rationale behind this integration:
Matt segues into an overview of ChefConf 2015 and asks everyone for their insights on this year’s event.
Bridget: One thing that stood out for me at ChefConf is that there’s a very unified community feeling. At a lot of conferences (that are very wonderful conferences!), there are very specific tracks, or specific groups of people who end up mixing more than with others, and at ChefConf, it definitely seems like there’s a very broad community who are all interacting with each other.
Trevor: I’ve been rocking the hallway track… and like Bridget said, everyone’s been fantastic. I’ve spoken with so many people here who I thought would never want to give me the time of day outside of our show, where we kind of have them pinned in a corner (laughs). The Open Spaces in particular were fantastic. Brandon Burton brought up a very sensitive topic: mental health, burnout, suicide… and it was such a breathtaking and emotional experience to hear everyone share personal experiences around that.
Julian: It’s really great that even as we’ve grown as a company and a community, we’ve retained certain attributes about that community. I think one of those is that there’s a little bit of quirkiness, and a little bit of uniqueness, and fun. Backend automation and IT has not necessarily been the most fun arena to work in, and I think this is a breath of fresh air to that sector. I hope we’re able to retain those attributes even as we continue to grow.
Matt: Some of this experience is really hard to explain. You can’t explain what the Las Vegas strip looks like at night, even if you’ve seen it in movies, until you’re there. I’m not saying this is like going to Vegas, but just like no one can describe the Matrix to you, you have to experience ChefConf for yourself.
More information on ChefConf:
Links:
DevOps revolves a lot around what an organization and culture should look like. We talk about it on just about every episode of this podcast. Something we tend to skate around though is the how. How do you change the culture of an organization?
Matt got to sit down and have an incredible conversation with Bill Joy of the Joy Group about the how of influencing change. We didn’t talk about what changes companies needed to make, we talked about how to get companies to make the changes. Bill shared the often overlooked fact that change influence isn’t restricted to top levels of power in a company.
If I were to ask you right now, “What would make your company better?” I’m pretty sure your mind goes into overload with all of the things that you would change if you could. The thing that most people fail to realize is that they can make the changes, or at least influence them.
But it can’t stop there.
In mandating a change, you have to be very aware of how it will affect the company itself; the infrastructure, the culture, anticipate any resistance, who are the key stakeholders are, etc. If you don’t allow for these, your change could be very well unsuccessful and you lose a great deal of credibility as a leader.
Non Executives Can Influence Change Too
You might remember back when we had John Allspaw on our Etsy episode. (Give it a listen, if you missed it.) When asked “How do you implement developments in an organization when I’m not on the top; I’m coming from the bottom of the chain.” Allspaw answered: “I don’t know. I’ve always been in charge.”
What a great position to be in. However, not everyone is a high executive with the capability to mandate change. This doesn’t mean that you can’t influence change.
The question arises then: How do you influence change in your organization from the bottom?
Patrick Debois once said “We should stop calling it DevOps and call it common sense.”
Here’s the problem with that, not everyone is going to see it your way. Common sense is a relative term. If during any of your influencing meetings you find yourself thinking “This is crystal clear to me, why aren’t they seeing it?” that’s more about you than it is about them.
When you make the decision to mandate, or in this case, influence change, it is your responsibility to present your ideas so that people can see them clearly. If they aren’t seeing them clearly, there is a good chance that you aren’t communicating them effectively and didn’t take into consideration who your audience is.
Before you attempt to influence another person, you need to have a firm grasp on who you are. For example, if you’re heavily dominant, your natural approach isn’t going to be effective on someone who is compliant. You have to adjust your approach and presentation to your audience.
Once you understand yourself, know your triggers. As a dominant go-getter, someone who takes a while to process things or lacks energy may really test your nerves in a social situation. Realize this upfront, prepare for it.
From here, you learn to adapt. Adapting is essentially wrapping the package up. Maybe you thought at first that you would have to have a presentation based on making things less structured to allow for more success. But, you realized you’re in a room with the key influencer who is very compliant; he/she likes configuration and organization. You then have to be ready tweak your strategy to make the presentation more effective and geared towards them..
If your client (who happens to be your boss) shoots down your ideas and offers a counter plan, you have to be ready to channel some boldness and stand firm in your beliefs. Instead of arguing back and forth or worse, completely backing down because of his/her position above you in the company, you could say something like “I see what you’re saying, but if we do it your way, this is what is going to happen”, and then lay out how their plan is flawed.
The Compliant One: The compliant person will sign the documents, put their time in, “do their TPS Reports”, as Matt said in the podcast, punch the clock day in and day out. They’ll do it because they need a job, or are too lazy to leave. Whatever the reason, they’ll make the changes, but won’t really care much about it.
The Commitment One: The commitment person will simply, “Believe in the TPS Reports”. They’re dedicated to the idea of change and are committed to the success of the company.
As you have more commitment, the success of your company will be exponential.
As Bill said; “Find satisfaction in the pursuit of commitment.”
The best change influencers are those who don’t see people as something they ‘have to deal with’. They’re authentically interested in them, and enjoy the pursuit of the how of change more than the change itself.
Tonight’s episode featured a special guest host, Julian Dunn of Chef.
Our panel: Ryn Daniels - @rynchantress - a web operations engineer from Etsy. Jake Champlin - @grubernaut - an operations engineer from Minted. (Younger than Trevor!)
Matt asks Ryn what it was like going to a place like Etsy - the gold standard! - what they expected, and what it was like when they got there. Ryn says that years of reading codeascraft and seeing etsy engineers speak at conferences meant that they were excited to start but also felt a little impostory. “Everyone knows that devops is the internet and the internet is cats - I show up my first day and my desk is covered in cat pictures.”
Jake talks about how devopsdaysPGH was his first tech conference, where he got connected to the community and met his future boss Alex Nobert. Julian asks how Jake got connected to Minted, and Jake talks about Bridget introducing him to Alex and how Minted seemed to have a great culture. Trevor asks what stands out about Minted’s culture, and Jake mentions that it’s woman-owned and has a strong focus on the product, and how everyone’s motivated to make a quality product and it’s reflected in your infrastructure.
Bridget suggests this might be the inverse of Conway’s Law, while Matt asserts that most “inverse Conway’s Law” discussions just end up proving it. Jake mentions how the community of Minted votes on some products, and the engineering team tries to bring that same excellence throughout.
Bridget asks Ryn how Etsy’s engineering practices influence or are informed by the culture of the company. Ryn mentions Etsy’s B-Corporation status, which means they are focused on social good, transparency, giving back to the environment - and that’s reflected in a lot of what they do, from codeascraft to blog posts to involving sellers in the community.
Both guests give a short version of what their company does; Ryn says that aside from the monitoring and sending everyone to Velocity, Etsy is the largest marketplace for handmade and vintage goods. When Jake explains Minted, it’s revealed that as “lazy podcasters”, we didn’t realize that Minted was as artist-focused as it is, providing artists a platform and community - Bridget thought it was primarily for paper goods, while Matt thought it had something to do with money.
Bridget asks Ryn about their decision-making process to decide that Etsy was right for them. They said that even though Etsy folks they met at Velocity didn’t know them, they were welcoming and never condescended to them. (Also, pink-haired thought leadership.) They also admired how Ian Malpass was mentioning at devopsdaysMSP how he wanted to put together a class for effective male allies inside Etsy.
Similarly, when Jake went to devopsdaysPGH, the way people were open and welcoming is what made him know he was in the right place. (A restaurant-related digression led to a shout-out to Jon Cowie’s knife-spork.)
Matt mentions having the feels and being excited about getting a chance to work with Julian Dunn! and how when starting a new job, when you feel intimidated, people in a good culture are going to reach out to you and make you feel welcome. Julian asks our guests what the experience of starting at one of these devops gigs was like.
Ryn talks about how starting at Etsy is less-stressful than smaller jobs in the past (since payroll isn’t an open question) and then mentioned the engineering rotations that allow new people to “bootcamp” with other teams, to increase understanding across the site. They also mentioned the “first push” program, where everyone (even non-engineers) learns to deploy a change to the site.
Jake’s working remote, and his onboarding process was like being thrown into the deep end of the pool. He felt impostor syndrome at that point. Bridget agrees that it can be an intimidating feeling, and asks Jake how he deals with that. He says that he’s working with really smart people and is happy to learn new things every day.
Trevor asks about cultural cues. Ryn says that they have a chat-heavy culture, and because there are so many remotes, even the in-person team will communicate as if they are remote, so you get to know what’s going on with the team and what their favorite cat gifs are. Trevor mentions introducing hipchat to a client and how it caught on quickly. Jake mentions that although only ops and qa are remote, everyone at Minted uses hipchat. Matt believes it’s frustrating when there are people in an org aren’t on chat, and Jake agrees that having the whole org there (HR/payroll, etc) makes communicating easy, even as a remote employee.
Ryn: “I do get to sit ten feet away from John Allspaw, so I’ve got that going for me.” Matt: “If you’re playing the Arrested DevOps drinking game, that’s a drink.”
Julian asks what are some of the downsides of a chat-heavy culture. Bridget mentions that at DramaFever, people have sometimes found chat to be distracting, if people are wanting your attention all day, and Ryn points out that at Etsy, it’s considered okay if someone needs to be heads-down working on something, either turning off notifications or signing out of chat. Trevor mentions that working with a global team can mean unintended awakenings from 3am @-mentions, while Ryn and Jake are both in cultures that encourage not having chat on their phones. Bridget says that at DramaFever, the solution is spaces in someone’s name so as to not alert them during off hours.
Jake talks about how being in an oncall rotation instead of being oncall 24/7/365 is great, and after a week of oncall, at Minted they get the next Friday off (and their co-workers will kick them out of chat if they join). Bridget asks Ryn about their blog post on work-life balance, and Ryn says they wrote that about disconnecting, and Etsy also has a culture of asking people to take care of themselves (where they threatened to take away their VPN access because they were working while sick).
Julian, who reveals he is stranded in Chicago and that’s why he is on the show, turns the conversation back to finding good roles and asking what role a recruiter plays. Ryn says that an Etsy person being oncall and troubleshooting during a Sysdrink meetup in New York attracted a number of applicants. That, codeascraft, speaking at conferences - showing what an org does, instead of just saying “we’re hiring” - works better.
Bridget asks them what makes a job a place they know is right for them. Jake says the culture, and working on interesting things with smart people. Ryn agrees, and says that their first week at Etsy they got to start contributing right away. Bridget points out that new people have a power that people who have been there longer can never get back - the power of not knowing how things are “supposed” to be (and the ability to write documentation to better serve a “don’t know the answer” POV). Ryn says that breaking Nagios Herald their first week (and then they also got to experience Etsy’s culture around blamelessness.) Jake is in a more greenfield situation, and so he was able to start contributing immediately out of necessity.
Bridget asks for the panel’s best advice for people who are interested in a “devops” job, want to find such a job, etc. Apparently the answer is having Bridget introduce you to people and/or convince you that you’re totally good enough to work somewhere. [Note from Bridget: this may not scale.] Both Jake and Ryn point out that connecting with the community on Twitter is a really valuable place to start making connections. Matt points out that liking your co-workers isn’t so much “nepotism” - you don’t have to party with them - but you spend a lot of time with your co-workers, so you’re going to want them to be people you don’t hate. Matt says, “It’s not know the ‘right’ people, it’s just ‘know people’.” Jake points out that at PGH, Mark Imbriaco and Ben Rockwood spoke to him as if he was a friend, even though they are prominent in the community. Ryn encourages everyone to use Twitter.
James Turnbull describes Docker as “a solution that is built by people to be usable by people, as opposed to some of the previous containerized solutions which were built by engineers to be usable by a very small subset of other engineers”.
When you might want to fly less… “when you start to recognize the airport lounge staff and the flight attendants on the New York-San Francisco route and they start to recognize you.”
James wrote The Docker Book to allow people of varying skill levels to quickly understand how to use Docker and what the practical applications could be for them. It’s intended to be a practical how-to guide.
At Kickstarter, developers have a dockerized replica of production on their laptops.
Matt asks if Docker can be only used in completely new deployments designed for Docker from the ground up. James points out that if you have existing infrastructure tools, it’s simple to create Dockerfiles from them.
The night before 1.0 launched at the first Docker conference in mid-2014, James removed all references to “don’t use this in production” from docker.com.
James mentions that Fig (soon to be renamed to Compose) helps with modeling multi-tier architectures locally.
James says, “People kinda forget the past and go, “oh my god Docker’s a pain in the ass to use”, and I’m like “compared to what, exactly? Compared to your previous build, or compared to you shipping around 10 ISO files and running Vagrant and 20 VMs on your local machine?”
He continues, “It wasn’t that long ago that the dark ages were real. I’m not suggesting that Docker’s a panacea, but it’s certainly a step in the right direction.”
James points out that something like Elasticsearch does well in Docker, since “it’s a bit of a fiddly thing to build, with the right version of the JVM, right version of Elasticsearch, prepping all the data, etc”.
James highlights continuous integration as a “sensational combination” with Docker.
On the controversy, James points out there will always be hype and people claiming “this is a revolutionary technology that will cure world hunger”. He says, “I’m fond of saying that Docker is a powerful tool to help you in your development life cycle […] not every workload in your data center is well-suited to Docker.” James doesn’t make technical architecture decisions based on the writing of tech journalists or blog posts, but rather by testing and evaluating the relative merits of a given solution.
In the case of Graphite, James would run carbon-relay and carbon-cache inside Docker containers, but he’d point them at a physical machine with SSDs to actually write the whisper files.
Matt read a blog post and reactions on reddit and wanted to see what James thought of the concerns around security and operability. James points out that empathy for developers is something sysadmins need to cultivate, because you don’t manage infrastructure for infrastructure’s sake.
James points out that the main reason developers ship code that doesn’t work in production is that they have no fucking idea what production looks like because there’s this grumpy asshole that manages production and they’re terrified to go ask them a question. Bridget says that as such a former grumpy asshole, she’s much happier when the devs aren’t afraid to talk to her.
James mentions that Docker containers are not virtual machines and should not be used to separate security concerns, and you should secure the host the containers are running on.
Matt: “I’m not suggesting that this [security concerns] is why DOCKER BAD…” Bridget: “Race conditions with devicemapper is why DOCKER BAD.”
James: “[PCI/DSS] is a low bar. If you followed simply the regulations for the compliance stuff that related to PCI/DSS, you would be running a massively insecure system.”
James points out that “owning” the standard gives one access to the marketing around an ecosystem. He also thinks that even if Rocket is a better technical solution, Docker has more traction.
Bridget: “So when I feel ranty about Docker and devicemapper, I should submit some pull requests.” James: “You should talk to Michael Crosby… Michael Crosby is currently in San Francisco somewhere going you motherfucker.”
James sees Amazon and Microsoft’s embracing of Docker as a great driver of revenue towards these cloud providers, if it gets developer code to production faster. They aren’t following hype; there are transparently obvious business reasons to do it.
In terms of skating to where the puck is going to be, James suggests looking at orchestration, software-defined networking, software-defined data centers - people building that sort of thing with Docker components. Docker Compose, Docker Swarm, people moving up the stack to manage different levels of abstraction.
James: “I challenge you to find a LAMP stack site where 80-90% of the configuration files aren’t identical - our secret knowledge of what to tweak isn’t as valuable as we think it is.”
Dave is at Next Big Sound, which does analytics for creative industries, and he’s seen a few orgs handle failure well, and a lot of organizations handle it poorly. He got interested in blameless postmortems and human factors in discussions with John Allspaw of Etsy, and Allspaw influenced him to read the work of David Wood and Sidney Dekker on human factors. He is writing a book for O’Reilly called Being Blameless.
MCR works at Etsy now, but has spent a lot of time consulting at various firms where he’s seen failure handled with blame. He points out what Rt. Lieutenant Colonel Scott Snook said in Friendly Fire, a book about when two US helicopters were accidentally shot down, that failure is part of complex systems.
MCR: “I work at Etsy, and that’s what we do - we examine failure as a learning opportunity.”
Dave is running his next workshop on Awesome Postmortems in NYC on February 12th, in which
Dave: “Sidney Dekker’s Field Guide to Understanding Human Error is probably the most important book for people like us, meaning people that are in the IT world - it’s very accessible and gives lots of examples from fields outside of IT, but they’ve very relevant to what we do.”
MCR: “Failure is gonna happen. It’s not a matter of if something is going to fail, it’s a matter of when it is going to fail.”
MCR mentions the different categories of failures - those that “fail closed”, that are easy to detect, like disk filling up, and “fail open” - the surprises. He mentions some of the techniques Etsy uses - an IRC warroom, Vidyo video chatting, to resolve an immediate issue. After the immediate issue is solved, the learning begins.
MCR: “We celebrate failure as much as we celebrate success here. […] The three-armed sweater is given to the person who most spectacularly impacted the website in the year.”
On the topic of why to do a blameless postmortem, MCR points out that it’s for learning, and there are both technical and human factors. Dave points out that blaming a person short-circuits the learning. Claiming that a person is the cause of the outage feels like a good story, but it’s not true.
Dave discusses root cause and mentions Allspaw’s excellent blog and a specific post about there being no such thing as a root cause, and Dave disagrees. He believes that outages are caused by change, and the systems with which we work are fundamentally changeable. “The impermanence of systems is the reason that they both function and malfunction.” Mike counters by saying, “Is there really a root cause for something that failed? If a hard drive dies, it’s the same hard drive. It hasn’t changed.” They both agree that it’s a philosophical rabbit hole.
MCR notes that as Etsy grows, they’ve found that user-impacting, service-degrading issues are when they do postmortems, and even if not user-impacting, if they can learn from a failure it’s worth doing one. Dave says, “The more we learn about the complex systems within which we work, the better we’re able to operate them.”
Within a week or two, according to Dave, is common practice of a time in which do the postmortem. MCR mentions that it’s important to write down the timeline almost immediately, definitely within a day or two, but doing it while someone’s amygdala is still triggered (and they are upset) is too soon. Dave points out that the facilitator of a postmortem sets the tone, including reminding people of hindsight bias, and at Next Big Sound they use a specific framework document which Dave will share. He also mentions defusing stress with empathy and humor.
On the topic of evaluating anything you do, MCR mentions that Etsy created Morgue because any department across Etsy can apply these techniques to learn. Dave points out they do retrospectives as well as prospective review at Next Big Sound. MCR says Etsy does both an architectural review and an operability review ahead of time. Dave mentions that answers in prospective reviews can be biased in a positive way, whereas in a “premortem” we imagine things going badly, and try to determine what could lead to that: in essence, harnessing hindsight bias to work for us.
Bridget forgets what decade it is and claims to have seen a presentation at devopsdays 2003. That would have been a nifty trick, since the first one was in 2009. :)
Microsoft is open-sourcing .NET and creating the CLR for Mac and Linux .Net core5 is the new framework for 5, you can ship your own version of the app.
There is now a free version of Visual Studio — Visual Studio Community for open source developers and students. In fact, it is all happening out in the open on github.
Emma Jane is a long time listener of ADO! She has been teaching version control for many years with specific emphasis on the communication behind version control in teams. She has since switched to distributed version control such as git. Her aim in her teachings is to create resources that make git ”less painful” than it currently is.
Emma Jane:
Matt: Many people hold the theory that you cannot have “The DevOps” without distributed version control. It implies communication through teams, so what is the validity of that statement.
Emma Jane:
Matt:
Emma Jane:
Trevor:
Emma Jane:
All kinds of people are interested in learning git. But mainly:
In order to identify how your team will most efficiently use git, draw out your team flow and identify where efficiency is being blocked. Is re-basing causing problems? Is a PR sitting out there for too long? Use those as discussion points with your coworkers.
You cannot introduce creativity when you are just told to memorize commands.
Emma Jane: - Use Interactive Add! It allows you to split up your diffs into different commits. So you don’t end up committing a huge chunk of features that should most-definitely not be committed together.
Look for the right git-flow based on the type of deployments you are going to be using to release the software. Are your deployments feature based? or time based? How important is a rollback?
Your code should always be deployable in a CD framework. You are only rolling forward, you have one master branch, and feature branches, how can you have correct and fast CD if you have multiple branches before the CD process starts.
Your git setup should be directly related to your infrastructure. The git releases and flows of a team of 1 is going to be massively different than the git flow of a large team for a Could Provider.
Rebasing:
Git Bisect
Source of Emma’s talk about Git: http://github.com/emmajane/gitforteams Post version: http://24ways.org/2013/git-for-grownups/ Recording: http://prague2013.drupal.org/session/git-makes-me-angry-inside
Emma’s rant about storing the history of your project: http://gitforteams.com/resources/evolution-social-coding.html
GitHub conversations: http://guides.github.com/introduction/flow/
Riffsy ios8 keyboard for animated gifs
You Suck at Technical Interviews
John Vincent - DevOps The Title Match
Encourage extension Visual Studio
What exactly does it mean to be a sysadmin? How do you define “sysadmin”?
What is your favorite thing about devops? How does it change your life as a sysadmin?
Matt has a belief that sysadmins are inherently cynical. Is he an idiot?
What do you miss about “the good old days?”
Alex Howells asks: “How often do you think ‘Fuck it all, I’m going to be a plumber or electrician?’"
Video of guys changing tires while car is on two wheels http://www.youtube.com/watch?v=5WLwRg3erm4
Amazon Certified Sysops Admin Associate http://aws.amazon.com/certification/certification-levels/certified-sysops-admin-associate
What is software delivery? There are a lot of approaches to this subject- what does “software delivery” mean at PagerDuty?
What is your idea of “best” way to deliver software, or line of best fit?
What gets in the way of companies or individuals delivering software?
-How do you mitigate and test for problems introduced by code changes? -Deployments? -Dependency issues? -External factors?
What are some patterns and anti-patterns for consistent software delivery?
On system rollback and totalised fields by Mark Burgess
The Company Management Beliefs DevOps only works for startups or web companies. DevOps doesn’t scale. You can’t do DevOps without being Agile
Business Semantics Shops practicing DevOps should have a DevOps team. We can’t do DevOps because we need separation of duties.
The Team Operations Assumptions DevOps means “developers do operations work” A DevOps is a sysadmin that uses config mgmt. DevOps is about hiring sysadmins who code.
Developer Expectations DevOps means developers get admin access in production Developers cannot be trusted.
The Tools DevOps only works with Open Source tools and operating systems (i.e., I can’t do DevOps in a Microsoft shop) The tools promote the DevOps cultural change.
The wrap up - DevOps doesn’t work.
Want to learn more about Configuration Management? Check out The Food Fight Show podcast!
A Game of Thrones: The Graphic Novel: Volume One
Things get off to a great start during the retro, where Trevor complains about destroying his USB 3.0 drivers due to a Win 8.1 upgrade and Matt turns into a Cylon.
Test Kitchen is now officially 1.0, but does’t really support Windows, but that doesn’t stop Matt from wanting to hack it to make it work anyway.
“Testing is any action taken to give you information about the actual state of your software, vs your assumptions” – Lanette
“A lot of developers look at testing like insurance – it’s not going to prevent a disaster, but it’s going to help you mitigate those problems” – John
Spirited discussion about the value of code coverage as a metric, and our panelists mostly violenly agree that it is not a valuable number in a vacuum. We also discuss that it is possible to approach all of life like a QA tester.
DevOps – the Title Match – John Vincent’ blog post about what DevOps is and isn’t, that Matt totally read from and referred to.
Food Fight Show – The Podcast where DevOps chefs do battle. These guys totally watched this live. Which is more than I can say for The Ship Show. Just kidding.
En liten tjänst av I'm With Friends. Finns även på engelska.