Cloud dev takes | Cloud dev takes

5 ways to dramatically speed up your cloud application teams

June 15, 2024 · 11 min read

Apps in the cloud

Working with applications teams and partners developing cloud native apps on Azure, you quickly learn developer time is valuable, enthusiasm & flow state is critically important.

Over the years, I've seen so many application team have to wait for an environment, wait for a access, a Service Now ticket, a support case, admin access to install tooling, productivity is dramatically effected by interruptions in flow state, projects can be 2x longer and of lower quality.

Equally, it's important to have a designed, governed secure environment when using the public cloud, so your workload teams start right, and stay right! There is excellent documentation on this in the Cloud Adoption Framework. This Blog however, specifically looks at the needs of application teams developing cloud native solutions.

Application teams work best when they can select their preferred platform services, tooling, languages and libraries, and most importantly, reduce their dependencies on external requests & constraints that interrupt their workflow. To this end, a priority for any Platform Team (the team providing the environment to application teams) should be to work towards a self-service model, removing themselves from the process wherever possible & constantly unblock friction points.

Where Platform teams should focus

You don't need to have everything automated from day 1, nor have tooling for everything, but focusing in these 5 crucial elements will result in more impact for everyone in your organization, avoiding unnecessary interruptions & bottlenecks, something I have seen depressingly way to often.

1. Environment Provisioning

When an application team works on a new product, timely access to an environment is important when enthusiasm is high. We should be targeting giving access to a fully operational environment within 30 minutes. This typically means vending a Resource Group with all the access they need to immediately start deploying their solution designs (more on this later). The resource group naming, tagging, the subscription sharing model and level of access can all be determined base on the environment requested.

A simple subscription sharing model can be designed to avoid too much subscription sprawl, based on department, or other organisational commonalities, but subscription separation should be enforced on workload type and connectivity (more on this below). You then end up with a subscription list that is meaningful and manageable, with resource groups that are named consistently. There are a number of good resources on naming recommendations

tip

Subscriptions in Azure can support hundreds of developers, the subscriptions have granular role-based controls, mature cost tracking services, and you can track Subscription Limits & usage very effectively. We recently had 300 developers across 35 resource groups, deploying resources across the globe, all working happily in a single sandbox subscription.

During the vending process, it will be important to capture:

Workload Type (e.g. Production, Non-production, Sandbox): This determines the policies that are applied to control what can be deployed & levels of access, also, keeps clear separation between these environments. Production and Non-Product/Sandbox workload types should be on separate subscriptions, however, a single Sandbox subscription can support may teams resource groups.
Required Networking (e.g. Connected or Non-connected): This determines if private IP connectivity is required by the workload, or if ingress/egress to the workload needs to be privately routed.

The first, simplest, most unconstrained environment you should offer is a Non-connected Sandbox, this allows the application teams the most flexibility to experiment with multiple services, full access to the environment in the portal to allow the team to rapidly get ideas to a POC stage. Here typically, there are no or little restrictions on access or resources that can be provisioned. The most constrained, and complex environment will be a Connected Production subscription, this will have policies to ensure production guardrails are followed, and networking to allow private IP connectivity, and ingress/egress routing controls (if needed).

A typical application would require the team to request 3 environments, starting with dev, then test (both non-prod and can share a subscription), and a production environment that would be created with a new subscription, each environment progressively getting more restrictive.

tip

The new Subscription Vending Bicep Verified Module is a excellent starting point to start vending these environments, from the simplest to the most complex with a module & parameter driven approach. You can collect the required information from the application team, then call the vending module directly from the az cli to start with, or create a pipeline/action in your favourite devops tool, maybe trigger a GitHub workflow from a Issue template:

alt text

tip

Hot Take #1: I'd recommend Bicep over Terraform when automating environment provisioning or application deployment on Azure, even if you are multi-cloud, it's a simple, powerful, performant 1st class experience, without needing the complexities of a state file, as the state is whatever is deployed in azure, and templates can be re-run and only the changes will be deployed.

2. Environment Permissions

So you have vended an environment, the app team tried to provision their first internally authenticated webapp that calls a gpt-4o model using identity based access, deployed using github actions... error error error 4 tickets in 5 minutes, now the team are googling for workarounds, not delivering their projects, wasting valuable time, and enthusiasm. What's the problem?

No permissions to create a Role Assignment on the webapp managed identity
OpenAI resource not registered in subscription
Cannot create Application Registration in EntraID
Require Admin consent for application permissions.
Cannot create federated access from github to deploy to Azure

When building cloud native apps, managed identity and role based access is a crucial part of the application architecture, and 100% the best and most secure way of creating cloud native applications.

Platform teams must provide the appropriate level of access to the application team to allow these solution architectures. I've seen this being the single thing that wastes tens/hundreds of hours of skilled peoples time

Recommendation #1

When assigning roles to the application team on non-production/sandbox resource groups, Contributor is not enough to create identity-based solution architectures! Consider providing the team Contributor & Role Based Access Control Administrator, this role can be scoped to the resource group, and can be further limited to only assign selected roles to selected principals.

Recommendation #2

Ensure resource provider registrations have been done as part of the vending process, and not blocking the application teams from creating their resources.

Recommendation #3

Many internal applications will need end-users to authenticate with your corporate EntraID tenant, the directory that protects all your Microsoft and 3rd party apps. These apps need application registrations, if your organization blocks the self-service creation of new application registrations, and/or has restrictive consent granting. Ensure the team know the process for requesting a new application registration. Also, unless you want a new Service Now ticket every time the app team what to add a new callback uri, consider adding the application team as an own owner of the app registration.

Recommendation #4

For Production deployments, look to remove any reliance on any individuals employee identities by using automated workflows that use Federated Identity Credential. Similar to Managed Identity, this will avoid having to vault/rotate client secrets in your devops platform (github/ADO), see Use Github Actions to connect to Azure.

Federated Identity Credential can now be provisioned directly in your vending process, using the new Bicep templates for Microsoft Graph. This will help the workload teams start right and stay right! :::

3. A little less documentation & a little more sample repos

Environment provisioning provides the application teams a blank slate at this stage, it doesn’t make any assumptions about the application teams solution architecture, this will allow the application team to select the optimal services for their use-cases, that could be a microservices app or a integration workflow, or a simple static webapp.

Selecting the appropriate service for the use-case will make the best use of the public cloud, optimize your public cloud costs while minimizing the required operations to support your application. Equally, it's doesn't assume the structure or number of repo's that the application team will use.

However, we should be providing the application teams more support than just a blank canvas, we should be looking to share successful architecture patterns, example applications that have already been approved for use within your organization.

Rather than documents, start to foster a innersource repo of samples, that can be simply provisioned into the vended environment, to show what a static webapp, or a simple microservices app, or an event driven process could look like. This can provide new teams a starting point with built-in approved patterns to accelerated their journey to production. These examples, with good READMEs can also inform the teams how to structure there application team repos with the infrastructure-as-code, and automation deployment workflows.

Look at the Azure Developer CLI templates as a good example of this, you don't need to use this tool, but azd template list shows a list of sample application patterns with well documented, structured repos. You can start to create a curated list that demonstrates getting started repos in each of the application solution categories for your organisation, even starting with some of these samples where relevant.

alt text

Infrastructure-as-code modules

Another thing to notice/adopt, in these samples repo /infra folders, their main bicep file is just composing a number of modules, these modules represent the 'right' way of configuring each service for your organization, for example, pre-configured with private endpoints and RBAC based access and so on. You can look to build a repo of these modules approved for use in your organization to again, accelerate your application teams. You can also get started by using Azure Verified Modules, or build your own bicep module library, using inner-sourcing, sharing this between the application teams.

Kubernetes namespace vending

note

This article doesn't cover Namespace vending on a large, shared AKS clusters. I've seen this pattern working very well in organizations with a Kubernetes-first approach to the public cloud, or with a strong will for cloud agnostic solutions, however, this requires additional engineering beyond the scope of this article using AKS + other tooling. There is great session from 2024 Build on building this service.

You can provide example solution patterns & repo's for scale-to-zero microservices using Azure Container Apps, or for more complex needs, AKS Automatic. These are great to add to your library of solution patterns that can be provisioned by your application teams that keep the complexity low, while providing example patterns for the most sophisticated of applications.

4. Tooling / Local loop development

Application teams experiment in the portal, develop locally, and provision from their local machine, then, add the automation and the managed identity to perform auto deployments via source control to apply in the later environments.

Ensure the teams can install/configure VS code / VS code extensions / command line tools / docker locally, and they have connectivity from these tools to the public cloud APIs they need.

@azure/identity libraries are now brilliant! For many dependencies, there is no need any more to use API keys or credentials that need to be stored in key-vault's & rotated periodically, now, just use your EntraID's corporate identity or a Managed Identity with RBAC. Using these identity libraries, If the developer wants to run their code locally, and connect to a database or message service in azure, the locally running app will operate with the local developers corporate identity (obtained through az login), and as long as the dev has the appropriate RBAC on the database, all good. If they deploy their app to Azure PaaS Service, without any code changes, the code will access the database using the services managed identity. This makes the apps secure and resilient, and can be prompted up to production securely.

Without these tools and access, the application teams will not be writing the most secure way of coding their app.

5. Track Metrics

Track anything that causes friction. Anytime the application team is waiting on something, a case, access to a service, resolving a bug, track it, dashboard it, and constantly priorities securely removing friction. Promote the creation of issues on the platform teams repo, keep a prioritized backlog. Hold monthly feedback sessions.

If Application teams are held up, they will try to work around issues to ship their product, this can mean using the wrong environment, or using a less than ideal service or configuration. So removing friction will result in better, more secure use of the public cloud.

Wrapup

Let me know what you think of these recommendations, if you are in a Platform team supporting Azure, I'd love to hear your experiences. If you are in a Application Team deploying to Azure, have a chat with the team providing you the environment, show them this blog, setup a regular call, its important the teams collaborate to get your companies products out the door, security, reliably and on time.

Everything's changed. Will I use React again?

December 16, 2023 · 5 min read

Keith Howling

Apps in the cloud

Starting a new Project in 2024

Vite

</> htmx

Time to build a ChatGPT app to learn how to develop solutions with a new, unified, user experiance. This will be a feed based UI with a form with 1 input fixed at the bottom, the scrolling feed rendering results both from chatGPT and Mongo. Sounds like a SPA, something like this:

So, let get it, starting with my trusty tech stack: React, Node/Typescript, Mongo. Recent times I've added tRPC & zod

tip

This makes the server/browser development as simple as possible, same language, same types, in fact I'm a big advocate of "Complexity Kills" even in the enterprise, so I try not to abstract, inject, complicate unless necessary. I've stayed clear of graphQL that's been a good call so far.

Everything has Changed

looking into recent release of React to iterate my stack with the latest updates, I see big changes. The last big update was hooks, and I loved them, so lets dive in!

tip

Big fan of the OSS team at Facebook, I see so much great tech coming from them. I started using client frameworks with Angular1.x back in the day, as I thought I could rely on google for webdev guidence, but, oh my god, what a mess, I spent 80% of the time understand the framework, and 20% building my app, then the Angular2 upgrades, ugh. I haven't used any google OSS projects since (aside from Kubernetes now in CNCF), but I hear the latest Angular is much improved.

create-react-app is no longer recommended, looks like everyone is using Vite for SPAs, and also, the biggest change of all, React has become a Server Side framework!

Vite

Ok, Vite it is! It actually turned out to be a great drop-in replacement for create-react-app, I don’t need to change my workflow, and everything is faster. So let's go!

2 days later, I realise this is no longer great choice for my app. Although it’s a SPA, I have hit 2 big issues:

The input needs to be POSTed to the server, and the resulting server message needs to be appended to the scrolling feed, I guess I will useState hook with an array of messages for the feed items, this will only ever get appended to, feels like this will slow down as this gets bigger and bigger. So do I useEffect for my main UI component, essentially bypassing react making it a leaky abstraction?
tRPC doesn’t appear to handle chunked http responses from chatGPT, so that abstraction is going to leak too & I'll need to build code around it.

OK, so as we are going to be doing a lot of calls to the server for each render, this is the time for React Server Components, yes, let's go, I'm excited!

NextJs

OK, more reading later, to use React Server Components (that look awesome), I now need a Meta-framework?, cannot just use React anymore. Ok, Next.js looks like the frontrunner.

warning

I was a little apprehensive about this, Next belongs to Vercel, that looks to be an opinionated cloud hosting provider . I want to deploy to non-opinionated service on a hyper-scale cloud, so concerned the Next roadmap will be focused on getting it to run really well on Vercel, or its edge. This is not my use-case.

Anyway, let's get stuck in! 2 days later, I realise this is no longer a great choice for my app. I thought I could append server components into the feed, allowing me to use await on the server to query mongo or chatGPT and returning html via JSX to append into the feed, but, I hit 2 big issues:

"You cannot import a Server Component into a Client Component" & "Server Components don’t have lifecycle events (hooks / effects / refs etc)" This is massive No matter how I tried to architect my app, I hit a brick wall. I couldn’t append Server Components to my feed UI Client Component. I couldn’t turn the feed into a Server Component because it needed state.
I didn’t really want or need a filesystem based router

So I made everything a Client component, and using the API Routes for fetching the server data. It worked, but what value was NextJs now. I wasn’t happy, this was shambles!

</> htmx

2 days later, I saw a youtube video on </> htmx. The DOM swapping beforeend directive jumped out at me, this looked like all I needed! But, I could build a webapp without React components, JSX

Turns out I could! And 2 days in, it works, its simple, event using SSE for the chunked chatGPT streaming responses!

I'm sticking with this, lets see how all the other features go, but I have high hopes....

So, whats next?

My end-2-end tech stack has completely revolved around javascript/typescript for years, I know it very well and I'm fast at getting prototyes out the door. I need types, they reduce so many bugs/typeos, but I do have a lot of pain with Typescript though, always seem to be fighting with module / import issues etc.

But now, if I no longer need Javascript for client logic, do I stick with it on the server? do I build my server in GO, maybe even .net8, what is the best templating engine?

My world is turning upside down….

Who am i

December 15, 2023 · One min read

Love writing prototypes, innovating with software and the cloud, using the best the open source world has to offer to deliver fast innovation

Where Platform teams should focus​

1. Environment Provisioning​

2. Environment Permissions​

Recommendation #1​

Recommendation #2​

Recommendation #3​

Recommendation #4​

3. A little less documentation & a little more sample repos​

Infrastructure-as-code modules​

Kubernetes namespace vending​

4. Tooling / Local loop development​

5. Track Metrics​

Wrapup​

Starting a new Project in 2024​

Everything has Changed​

Vite​

NextJs​

</> htmx​

So, whats next?​

Where Platform teams should focus

1. Environment Provisioning

2. Environment Permissions

Recommendation #1

Recommendation #2

Recommendation #3

Recommendation #4

3. A little less documentation & a little more sample repos

Infrastructure-as-code modules

Kubernetes namespace vending

4. Tooling / Local loop development

5. Track Metrics

Wrapup

Starting a new Project in 2024

Everything has Changed

Vite

NextJs

</> htmx

So, whats next?