Simplify or clarify how to refactor an CDK codebase #4733

abelmokadem · 2019-10-29T06:49:05Z

This is just to start a discussion, but currently I'm finding it really hard to refactor an AWS CDK codebase because of the CloudFormation resource names. Is it possible to create a separation between the names and the structure of the code. I understand that this is at the core of aws-cdk and it allows us to create reusable constructs, but this also makes refactoring nearly impossible.

eladb · 2019-10-29T07:13:15Z

Can you elaborate why does it make refactoring impossible?

abelmokadem · 2019-10-29T07:24:49Z

I think I need to clarify what I mean by refactoring. In this case I'm just talking about moving your resources into and out of constructs or breaking up a construct into smaller constructs. The end result of this refactoring process should be that cdk diff doesn't show any difference. That would be my ideal situation.

In the current situation resources are renamed, thus removed and new ones are created. Think of databases.

orangewise · 2019-10-29T07:28:46Z

I agree, @abelmokadem has a valid point. Would be great if this was easier.

eladb · 2019-10-29T07:31:28Z

Thanks for the clarification. Unfortuntelly, we don't have good solution to this problem that will not require you to explicitly specify absolute logical IDs to all resources that you define, and as you observed, this will dramatically hinder composability and reusability.

But we can talk about a few things:

You can always simply set logicalId in the defaultChild if you want to preserve a logical ID after a refactor. Obvsiouly that's not ideal, but it's an "escape hatch" that is available to you in case, for example, you don't want you DynamoDB table to be removed.
We are looking at adding some more formal definition of "stateful resources" (#2282). This will give us a bit more understanding as to which resources in your stack are sensitive to replacement, and we could, at least, protect you from disastrous renames.

This has been something we have discussed for over a year as we designed the CDK, and possibly the biggest challange we had in designing the programming model. I can't say we have the perfect solution, but we could not come up with a better solution.

If anyone has ideas, we would love to hear from you, but we can't compromise compability. It's a key design tenet that makes CDK possible.

EDIT: added reference to #2282

abelmokadem · 2019-10-29T08:07:47Z

I have somewhat of an idea. Please let me know if this absolutely can't work.

I also managed to get myself confused somewhere down the road, but I still think it could work. It could be a big change for aws-cdk, but for end users it could just be a matter of creating a stack with enableRefactor option set to true.

In short:

Use UUID as logicalId for all resources in the stack
Introduce cdk refactor --mapping <mapping_file> command to "refactor" your stack
Create a mapping file where you map the generated logicalId to the new cdk path

When running cdk deploy:

for resource in stack:
  // Update existing resource
  logicalId = find uuid for resource by current cdkPath

  if (!logicalId):
    // Create new resource
    logicalId = generate uuid
  
  addMetadata(cdkPath)

When running cdk refactor --mapping mapping.yaml:

for resource in stack:
  // "Refactor" existing resource to new cdk path
  logicalId = mappings[cdkPath]
  
  if (!logicalId)
    // Update existing resource
    logicalId = find uuid for resource by current cdkPath

  if (!logicalId):
    // Create new resource
    logicalId = generate uuid
  
  addMetadata(cdkPath)

eladb · 2019-10-29T08:34:38Z

Use UUID as logicalId for all resources in the stack

When will the UUID be generated? It has to be stable across synths, so we can't generate it at runtime.

abelmokadem · 2019-10-29T08:46:30Z

@eladb it is generated only when cdk can't find the CDK path in the cloudformation template of the currently deployed stack.

cdk deploy

if any resource in deployed stack contains cdk path {
  use uuid of that resource from the deployed stack
} else {
  generate uuid
}

eladb · 2019-10-29T10:00:24Z

use uuid of that resource from the deployed stack

How do we know the uuid of a resource from the deployed stack? We need a stable address/identity in order to correlate between the resource in your code and the deployed resource.

abelmokadem · 2019-10-29T10:04:40Z

You find the uuid by looking for the cdk metadata field. You can use this field to correlate between the resource that is in your code.

      "Metadata": {
        "aws:cdk:path": ".../Dns/StackHostedZone/Resource"
      }

If this path does not exists in both the mapping file and in the deployed stack, it means you are creating a new resource.

eladb · 2019-10-29T10:11:30Z

aws:cdk:path": ".../Dns/StackHostedZone/Resource

But the problem is that if you refactor your code, this path will change and you "lose" your connection to the actual resource.

abelmokadem · 2019-10-29T10:19:56Z

That is what the mapping file is for. If you treat refactor as a separate action, then you can create a cdk refactor --mapping mapping.json command.

mapping.json

{
  "new cdk path": "existing uuid"
}

This action will simply update the cdk path metadata in cloudformation. That is all it should do if there are no changes to the stack but you moved a resource.

abelmokadem · 2019-10-29T10:32:12Z

You could even consider making a part of cdk deploy to make things stateless. I haven't thought of how to do that. You would have to add additional metadata to keep track of which "refactor" was executed and which was not.

moofish32 · 2019-10-29T13:23:38Z

As another CDK user/contributor I think this mapping file really pushes against the grain of CDK and CloudFormation (CFN). In my mind CFN owns the stateful management, I feel like this mapping file starts to be an artifact I might need to preserve. For example, if you make a refactor and you need to promote this from dev to perf to prod, who validates this mapping file in each step and how long do I need to keep it? In VCS or out?

Counter proposal

If I take a little inspiration from state file management in other systems (see below for TLDR). I like the direction of a stateful boolean. In my head instead of stateful, I think about this as a "root" namespace where the rule of logical IDs are changed. In this namespace the logical ID is the result of the name/ID provided in the constructor and the hash of the properties that cause destroy/recreate (no longer is CDK path in the logical ID here and we have to manage name length). The user/consumer now owns the responsibility of ensuring unique names for the "root" namespace and if you change a property that causes a destroy/recreate the logical ID will clearly indicate that and diff will detect it.

state file management in other systems TLDR

If we look across the landscape this is why Terraform maintains state externally and it's also why if I corrupt my statefile it can be very hard and potentially impossible to restore. However, because the user accepts the responsibility we now have the ability manually manipulate the state file and move resources between modules.

eladb · 2019-10-29T13:37:31Z

@abelmokadem are you aware that we already have support for renaming logical IDs globally, exactly for the refactor use case. It's almost exactly like your mapping file, but in code (and of course, if you want you can load a rename map from a file).

eladb · 2019-10-31T06:37:38Z

For posterity, people can also completely override how resource logical IDs are allocated by subclassing Stack and overriding: allocateLogicalId.

I am closing this for now. Please reopen if you feel you don't have sufficient tools to be able to control your logical names in a refactor.

foriequal0 · 2020-11-13T07:39:57Z

After some refactoring, I have so many stack.renameLogicalId()s in my codebases, and I can't remove them to keep the resources. I wish there is some renaming deploy.

Here's a draft of the idea:

Mark some resource elements with an API something like this

const subnet = new PrivateSubnet(this, "Private", { ... });
rename({ element: subnet.node.findChild("NATGateway"), oldId: "FooBarVpcSubnetNATGatewayABCD1234") })

Check all the resources can be imported into existing stack.
https://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/resource-import-supported-resources.html
Prepare following ChangeSets.
1. PREPARE: All the resources in a template have old ID, with "DeletionPolicy": "Retain".
2. REMOVE: Remove all marked resources from PREPARE template. All their references are replaced with their physical ID.
3. REIMPORT: All the old ID is changed to the new ID from the PREPARE template. The ChangeSet's ResourcesToImport has new ID and physical ID pairs.
4. REIMPORT_CLEANUP: Detach "DeletionPolicy": "Retain" from REIMPORT template.
5. REMOVE_REVERT: The template is same with PREPARE. The ChangeSet's ResourcesToImport has old ID and physical ID pairs.
6. REMOVE_REVERT_CLEANUP: Detach "DeletionPolicy": "Retain" from REMOVE_REVERT template.
Execute ChangeSets in this order.

PREPARE -> REMOVE -> REIMPORT --Okay--> REIMPORT_CLEANUP
                            \--Error--> REMOVE_REVERT -> REMOVE_REVERT_CLEANUP

Remove rename(...) calls in the code.

abelmokadem · 2021-03-19T09:04:47Z

aws/aws-cdk-rfcs#162

brendonparker · 2021-07-09T14:06:57Z

@eladb I know this is an old issue, but do you have an example we can reference for how to combine allocateLogicalId with renameLogicalId?

Much appreciated.

EDIT
Something like this? (in c#):

protected override string AllocateLogicalId(CfnElement cfnElement)
{
    if (cfnElement.ToString() == "App/DDB/TableEverything/Resource [AWS::DynamoDB::Table]")
    {
        RenameLogicalId(cfnElement.LogicalId, "TableEverything9827B373");
    }
    return base.AllocateLogicalId(cfnElement);
}

In my case, I did some refactoring which moved the DDB table into its own construct.
IDK if there is a "better" way to detect which CfnElement I'm interested in.

brendonparker · 2021-07-09T14:44:49Z

Hmm, I couldn't get that to work.

I was attempting to move a table from the stack contstuct into its own construct.
In the end, I did a cdk synth after the construct change/refactor and compared that logicalid in output with my past output.

Then added:

RenameLogicalId("DDBTableEverythingB1F30BC5", "TableEverything9827B373");

Where the first param is the logicalid generated by the most recent change, and second param is the historical logicalid.

abelmokadem added the needs-triage This issue or PR still needs to be triaged. label Oct 29, 2019

abelmokadem changed the title ~~Separate constructs from cloudformation names~~ Simplify or clarify how to refactor an CDK codebase Oct 29, 2019

SomayaB assigned eladb Oct 29, 2019

SomayaB added @aws-cdk/core Related to core CDK functionality feature-request A feature should be added or improved. and removed needs-triage This issue or PR still needs to be triaged. labels Oct 29, 2019

eladb closed this as completed Oct 31, 2019

gliptak mentioned this issue May 28, 2021

updating aws_cdk.core.Stack id leaves previous CF stack deployed/behind #14911

Closed

malcyL mentioned this issue Jul 15, 2022

feat: ele-4721 add support for security headers for cdn site hosting techfromsage/talis-cdk-constructs#60

Closed

epheat mentioned this issue Dec 19, 2022

CDK Refactoring Tools aws/aws-cdk-rfcs#162

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify or clarify how to refactor an CDK codebase #4733

Simplify or clarify how to refactor an CDK codebase #4733

abelmokadem commented Oct 29, 2019

eladb commented Oct 29, 2019

abelmokadem commented Oct 29, 2019 •

edited

Loading

orangewise commented Oct 29, 2019

eladb commented Oct 29, 2019 •

edited

Loading

abelmokadem commented Oct 29, 2019 •

edited

Loading

eladb commented Oct 29, 2019

abelmokadem commented Oct 29, 2019 •

edited

Loading

eladb commented Oct 29, 2019

abelmokadem commented Oct 29, 2019

eladb commented Oct 29, 2019

abelmokadem commented Oct 29, 2019 •

edited

Loading

abelmokadem commented Oct 29, 2019

moofish32 commented Oct 29, 2019

eladb commented Oct 29, 2019

eladb commented Oct 31, 2019

foriequal0 commented Nov 13, 2020

abelmokadem commented Mar 19, 2021

brendonparker commented Jul 9, 2021 •

edited

Loading

brendonparker commented Jul 9, 2021

Simplify or clarify how to refactor an CDK codebase #4733

Simplify or clarify how to refactor an CDK codebase #4733

Comments

abelmokadem commented Oct 29, 2019

eladb commented Oct 29, 2019

abelmokadem commented Oct 29, 2019 • edited Loading

orangewise commented Oct 29, 2019

eladb commented Oct 29, 2019 • edited Loading

abelmokadem commented Oct 29, 2019 • edited Loading

eladb commented Oct 29, 2019

abelmokadem commented Oct 29, 2019 • edited Loading

eladb commented Oct 29, 2019

abelmokadem commented Oct 29, 2019

eladb commented Oct 29, 2019

abelmokadem commented Oct 29, 2019 • edited Loading

abelmokadem commented Oct 29, 2019

moofish32 commented Oct 29, 2019

Counter proposal

state file management in other systems TLDR

eladb commented Oct 29, 2019

eladb commented Oct 31, 2019

foriequal0 commented Nov 13, 2020

abelmokadem commented Mar 19, 2021

brendonparker commented Jul 9, 2021 • edited Loading

brendonparker commented Jul 9, 2021

abelmokadem commented Oct 29, 2019 •

edited

Loading

eladb commented Oct 29, 2019 •

edited

Loading

abelmokadem commented Oct 29, 2019 •

edited

Loading

abelmokadem commented Oct 29, 2019 •

edited

Loading

abelmokadem commented Oct 29, 2019 •

edited

Loading

brendonparker commented Jul 9, 2021 •

edited

Loading