ELC
+00:00 GMT
Developer Productivity & Effectiveness
# DevTools
# Roundtable

Reliability Best Practices. SRE Lessons for Engineering Teams

During the online discussion on "Reliability Best Practices," engineering teams explored strategies to enhance reliability through data collection and analysis. The conversation highlighted the importance of observability tools like Sentry, Data Dog, and Chronosphere for collecting logs, metrics, and traces. Participants also discussed managing alerts to reduce alert fatigue and the significance of prioritizing reliability through proactive incident response and addressing technical debt. The aim was to guide engineering leaders in implementing best practices for creating more robust and dependable systems.
Popular topics
# ELC Annual 2022
# Leadership
# Change Management
# Career
# Culture
# Technology
# Communication
# Management
# Team Building
# Resilience
# AI
# Trust Building
# DevTools
# Collaboration
# Work-Life Balance
# Execution
# DevOps
# Start-ups
# Strategy
# Decision Making
All
During the session, we gained some insights of managing risks without rollback options, managing the costs and complexities of migrations, and maintaining observability and security.
# DevTools
# Roundtable
56:10
In this presentation, Wiremock demonstrated their API developer productivity platform, addressing common challenges like API dependencies that slow down development. The demo highlighted how Wiremock enables parallel development by allowing teams to create mock APIs, letting front-end and back-end teams work simultaneously, even before the actual API is ready. Community members can see how this approach speeds up iteration and improves team collaboration. The session also covered advanced features such as dynamic responses, stateful mocking, and validation, making it easier to maintain realistic and efficient development environments. More info: "WireMock is an API Developer Productivity Platform. They help developers and testing teams to overcome challenges related to dependencies on unstable third-party APIs, limited sandboxes, or internal APIs that delay their work. By enabling developers and testing teams to mock/virtualize these APIs dependencies, they can create isolated environments which allows them to deliver faster, reduce time to market, increase velocity and product quality throughout their development cycle."
# DevTools
12:30
In this presentation, Crafting introduces its cloud-based development platform aimed at simplifying complex development workflows. Crafting enables developers to work entirely through a browser, offering a seamless, cloud-hosted environment with features like a web terminal, web-based VS Code, and integration with local tools. The platform allows for standardized environments, supports complex setups with microservices, and facilitates debugging and testing through instant changes. It also provides shared staging environments, where developers can safely test and debug their services without disrupting others. Crafting aims to streamline the development process, allowing for quicker iterations and increased productivity. More info: "Crafting Inc provides the innovative development platform as a one-stop solution for engineering teams to work productively in simplified, rapid-replicable environments at low cost."
# DevTools
8:40
Alerty showcased a monitoring platform tailored for frontend teams, filling a gap in existing tools that focus mainly on backend needs. Unlike traditional solutions like Datadog, Alerty offers user experience monitoring and error tracing for frontend applications. The demo showcased Alerty’s automatic alert setup and AI-driven troubleshooting, helping teams quickly resolve issues like typos and DNS errors. Integrating with frontend technologies like Next.js, it provides detailed insights into performance and user interactions, emphasizing ease of setup and actionable alerts. More info: "Alerty is an AI SRE that can monitor your app, answer questions, send intelligent alerts, and automate issue resolution."
# DevTools
13:18
Resourcely showcased its solution aimed at simplifying the work of infrastructure and DevOps teams. In the modern DevOps landscape, developers often face hurdles when managing cloud infrastructure, such as configuring S3 buckets or setting up databases. Resourcely addresses these challenges by providing two core components: blueprints and guardrails. Blueprints offer developers pre-configured patterns for cloud resources, while guardrails ensure compliance with best practices. By integrating these elements, Resourcely enables developers to move faster and stay focused on coding, while automating reviews and approvals for risky configurations. The session emphasized how Resourcely aims to boost productivity without compromising security or configuration standards. More info: "Resourcely helps engineering leaders ensure reliability, scalability, availability, and security of cloud infrastructure by enabling developers via self-service. Our customers can quickly set up golden patterns (blueprints) and guardrails (desired configuration settings) to guide developers towards correct configuration. Resourcely creates standard Terraform that works with customers' existing tools and pipelines."
# DevTools
12:29
Bacca introduces its groundbreaking AI-powered SRE designed to alleviate operational burdens from engineering teams. The speaker discusses the challenges faced in managing incident response and system reliability, highlighting the frequent firefighting that hampers productivity. Bacca aims to change this dynamic by leveraging AI to autonomously analyze incidents, gather historical context, and suggest actionable solutions, all within familiar tools like Slack. The demo showcases Bacca’s ability to provide insights and streamline incident management, ultimately enabling teams to focus on innovation rather than maintenance. For those interested in enhancing their operational efficiency, Bacca invites viewers to explore a partnership for integrating this AI solution into their engineering workflows. More info: “Bacca is the first AI-powered SRE designed to own your on-call shift. Seamlessly integrated with your existing operation tools such as Datadog, PagerDuty and Slack, bacca jumps into action at the first sign of trouble, forming triage plans, conducting investigations, and arriving at a trustworthy root cause analysis. Just like your most experienced engineer, bacca leverages historical operational experience and institutional knowledge to troubleshoot incidents with precision, all while maintaining the highest levels of security and privacy. Bacca is the perfect partner for growth-stage engineering teams experiencing operation challenges, reducing incident MTTR and freeing up your valuable engineering resources so you can focus more on innovations over operations.”
# DevTools
12:47
Daytona showcased its development environment manager designed for flexibility and security in engineering workflows. The demo highlighted Daytona’s compatibility with various repositories, IDEs, and code assistants, enabling developers to personalize their setup easily. With robust security features, Daytona ensures compliance with governance requirements while facilitating efficient collaboration among team members. The open-source component allows individuals to explore its capabilities, while the commercial version is tailored for teams seeking to standardize their development environments. Overall, Daytona aims to simplify the development process, allowing teams to focus on innovation rather than infrastructure. More info: "Daytona is an open-source Development Environment Management platform that enhances developer productivity and experience. By automating the entire setup process, Daytona creates unified and standardized development environments, reducing the burden on your DevOps team by handling the orchestration of these environments."
# DevTools
8:50
Vantage showcases its innovative cloud cost observability and optimization platform. Designed for users of major cloud services like AWS, Azure, and Google Cloud, Vantage enables organizations to efficiently manage and optimize their cloud spending. The platform offers comprehensive insights into cloud costs, enabling users to track expenses, forecast budgets, and implement financial optimizations. Key features highlighted include customizable dashboards for monitoring costs across multiple providers, detailed reports on resource usage, and advanced tools for Kubernetes workload management. Vantage also supports anomaly detection, helping teams identify and address unexpected cost increases. This session emphasizes the platform’s self-service capabilities, making it accessible for teams to manage their cloud expenses effectively. More info: "Vantage is a cloud cost observability and optimization platform for AWS, Azure, GCP, Datadog, Github and 10 other cloud service providers. The company has raised $25M from a16z and Scale Venture Partners and currently assists 12,000 organizations ranging from startups to F500s manage billions of dollars of annualized cloud costs. More details can be found at www.vantage.sh"
# DevTools
10:04
In this presentation, Superlinked explored the challenges of building vector search systems tailored for complex data. They emphasize the importance of effectively managing various data types beyond simple text, such as numbers and structured information commonly found in databases. The session highlights practical solutions for creating custom embedding models that enhance data ingestion and querying. Attendees can access valuable resources, including an open-source database comparison table and GitHub repository for hands-on experimentation. The talk aims to equip community members with insights and tools to harness vector search technology for real-world applications. More info: "Superlinked is a framework and soon a cloud service that helps AI & Data teams build vector embedding-powered software across RAG, Search, Recommendation Systems and Analytics. It is specifically focused on constructing custom data & query embedding models from pre-trained components, cutting down on time-to-market and the amount of compute required for evaluation and for production."
# DevTools
13:55
Ian Nowland
Ian Nowland · Sep 13th, 2024
Slides available here: https://docs.google.com/presentation/d/1gn6Ru0nunSjw0OqI0O3iP2LSn5gbgfWTcB6iyX9z9fY/preview#slide=id.p No-one quite knows what "Platform Engineering" is, let alone why it is now becoming a best practice. On the one hand, you have some vendors claiming that all you need is their developer portals and 4 weeks of integration. On the other hand, you have DevOps and SREs saying they have been doing it all along, and it's just a management fad. In this talk, I will cover my learnings of "what" and "why" from co-authoring a book on Platform Engineering, helping those new to the term evaluate whether they should be doing it, and for those already practising, giving you the context to better justify your value to customers.
35:52
Popular