Rearrange name poisoning logic to do a little less work. #4766

zygoloid · 2025-01-07T20:10:54Z

Insert the poison at the same time we do the name lookup to avoid doing
two hash table lookups into each scope. This adds a bit of complication
because import logic now needs to cope with importing a name that is
already poisoned, but the complexity seems worthwhile to reduce the
number of name lookups performed.

This incidentally fixes a bug where we wouldn't poison any name scopes
if we found the name in an enclosing lexical scope, leading to one extra
diagnostic in existing tests.

Part of #4622

Insert the poison at the same time we do the name lookup to avoid doing two hash table lookups into each scope. This adds a bit of complication because import logic now needs to cope with importing a name that is already poisoned, but the complexity seems worthwhile to reduce the number of name lookups performed. This incidentally fixes a bug where we wouldn't poison any name scopes if we found the name in an enclosing lexical scope, leading to one extra diagnostic in existing tests.

bricknerb

Thanks for doing this!
It's definitely more optimized.
I do not fully understand the behavior change though.
What I thought we were trying to do before is when lookup was successful, poison the name in all the scopes between where the name was found and where we started looking.
From the test change, it seems that now we poison a name even when the lookup is not successful.
Is this what we want?
See #4774

bricknerb · 2025-01-08T09:54:47Z

toolchain/sem_ir/name_scope_test.cpp

  EXPECT_THAT(name_scope.entries(),
              ElementsAre(NameScopeEntryEquals(
                  NameScope::Entry({.name_id = poison1,
                                    .inst_id = InstId::PoisonedName,
                                    .access_kind = AccessKind::Public}))));

  NameId poison2(++id);
-  name_scope.AddPoison(poison2);
+  EXPECT_EQ(name_scope.LookupOrPoison(poison2), std::nullopt);


I think we should add coverage for cases where LookupOrPoison() doesn't return std::nullopt.

bricknerb · 2025-01-08T10:13:08Z

toolchain/sem_ir/name_scope.cpp

-  CARBON_CHECK(result.is_inserted(), "Failed to add required name: {0}",
-               name_entry.name_id);
+  if (!result.is_inserted()) {
+    // A required name can overwrite poison.


Is this tested in the unit tests?

jonmeow · 2025-01-08T17:12:18Z

Can you add some context for why #4622 is mentioned in the PR description?

jonmeow

Are you thinking about something like SFINAE for templates to get name poisoning to be an error this way?

jonmeow · 2025-01-08T17:21:57Z

toolchain/check/context.h

+  // If `for_decl_name` is false, then this is a regular name lookup, and the
+  // name will be poisoned if not found so that later lookups will fail; a
+  // poisoned name will be treated as if it is not declared. Otherwise, this is
+  // a lookup for a name being declared, so the name will not be poisoned, but


"for_decl_name" making me stumble a little, partly with "is" versus "for". Noting the way you describe it here and how it's called, had you considered something like "is_in_decl" to match "LookupNameInDecl", or "is_declaring", something like that?

jonmeow · 2025-01-08T17:29:25Z

toolchain/sem_ir/name_scope.h

+  // Adds a new name known to not exist. The new entry may not be poisoned. This
+  // is allowed even if the name has already been poisoned.


If the name was already poisoned, is the new entry still poisoned? That's particularly important to capture since we're looking at storing instructions for poisoned names, so "has instruction and is poisoned" is becoming a valid state. Maybe:

Suggested change

// Adds a new name known to not exist. The new entry may not be poisoned. This

// is allowed even if the name has already been poisoned.

// Adds a new name known to not exist. The new entry won't be poisoned, and

// can overwrite poisoned names.

jonmeow · 2025-01-08T17:34:11Z

toolchain/sem_ir/name_scope.h

+  // Searches for the given name and returns an EntryId if found or nullopt if
+  // not. If the name is not found, it will be poisoned so it can't be declared
+  // later.


Would it be worth phrasing this closer to how LookupOrAdd's comment is phrased? e.g.:

Suggested change

// Searches for the given name and returns an EntryId if found or nullopt if

// not. If the name is not found, it will be poisoned so it can't be declared

// later.

// If the given name already exists, return the EntryId; the entry might be

// poisoned. Otherwise, poisons the name so that it can't be declared later

// and returns nullopt.

(note, either way, I think it'd be good to have "if not found" behavior grouped instead of "or nullopt if not. If the name is not found,")

zygoloid · 2025-01-08T18:52:43Z

What I thought we were trying to do before is when lookup was successful, poison the name in all the scopes between where the name was found and where we started looking. From the test change, it seems that now we poison a name even when the lookup is not successful. Is this what we want?

The main behavior change is that we now poison the name in the case where lookup is successful and finds a lexical result. Previously we only poisoned the name when lookup was successful and found a non-lexical result. Ultimately, the idea is that any time we look for a name in a declarative scope, other than when declaring it, it's an error if we don't find it and we later add it. So poisoning the name as part of the lookup seems like the right model.

In the case of a failed lookup followed by a declaration, I do agree that the best thing would be to produce only a single error. Diagnosing both the failed lookup and the declaration of a poisoned name doesn't seem terrible to me -- it provides all the information that we have, which I think is probably a little more useful than diagnosing only the use of the undeclared identifier. Ideally I think I'd want us to produce a single "use of name before its declaration" error for the use that also points at where the name is later declared. That said, we can't really do that unless we produce diagnostics out of order (or modify the diagnostic after we initially emit it), and I'm nervous about building dependencies on diagnostic reordering given #3054, so we might need to think a bit about how to fit that into our diagnostic infrastructure.

bricknerb · 2025-01-09T08:05:59Z

Can you add some context for why #4622 is mentioned in the PR description?

I've added it as this change seems to be part of name poisoning feature.

bricknerb · 2025-01-09T11:07:35Z

What I thought we were trying to do before is when lookup was successful, poison the name in all the scopes between where the name was found and where we started looking. From the test change, it seems that now we poison a name even when the lookup is not successful. Is this what we want?

The main behavior change is that we now poison the name in the case where lookup is successful and finds a lexical result. Previously we only poisoned the name when lookup was successful and found a non-lexical result. Ultimately, the idea is that any time we look for a name in a declarative scope, other than when declaring it, it's an error if we don't find it and we later add it. So poisoning the name as part of the lookup seems like the right model.

In the case of a failed lookup followed by a declaration, I do agree that the best thing would be to produce only a single error. Diagnosing both the failed lookup and the declaration of a poisoned name doesn't seem terrible to me -- it provides all the information that we have, which I think is probably a little more useful than diagnosing only the use of the undeclared identifier. Ideally I think I'd want us to produce a single "use of name before its declaration" error for the use that also points at where the name is later declared. That said, we can't really do that unless we produce diagnostics out of order (or modify the diagnostic after we initially emit it), and I'm nervous about building dependencies on diagnostic reordering given #3054, so we might need to think a bit about how to fit that into our diagnostic infrastructure.

Thanks for clarifying!

Regarding the behavior change, I'm not sure I understand what is a lexical result vs. a non-lexical result here.
I don't think we have a test that covers that (the modified test in this PR is for a failed lookup, AFAIU), so adding a test would probably clarify what is changing and makes sure it stays that way.

zygoloid added 2 commits December 20, 2024 00:08

Merge branch 'trunk' into toolchain-simplify-lookup

322e94b

zygoloid requested a review from bricknerb January 7, 2025 20:10

github-actions bot requested a review from jonmeow January 7, 2025 20:11

github-actions bot added the toolchain label Jan 7, 2025

bricknerb reviewed Jan 8, 2025

View reviewed changes

jonmeow mentioned this pull request Jan 8, 2025

Add a test that shows names are not poisoned when lookup fails #4774

Merged

jonmeow reviewed Jan 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rearrange name poisoning logic to do a little less work. #4766

Rearrange name poisoning logic to do a little less work. #4766

zygoloid commented Jan 7, 2025 •

edited by bricknerb

Loading

bricknerb left a comment

bricknerb Jan 8, 2025

bricknerb Jan 8, 2025

jonmeow commented Jan 8, 2025

jonmeow left a comment

jonmeow Jan 8, 2025

jonmeow Jan 8, 2025

jonmeow Jan 8, 2025 •

edited

Loading

jonmeow Jan 8, 2025 •

edited

Loading

zygoloid commented Jan 8, 2025

bricknerb commented Jan 9, 2025

bricknerb commented Jan 9, 2025

		// Adds a new name known to not exist. The new entry may not be poisoned. This
		// is allowed even if the name has already been poisoned.

Rearrange name poisoning logic to do a little less work. #4766

Are you sure you want to change the base?

Rearrange name poisoning logic to do a little less work. #4766

Conversation

zygoloid commented Jan 7, 2025 • edited by bricknerb Loading

bricknerb left a comment

Choose a reason for hiding this comment

bricknerb Jan 8, 2025

Choose a reason for hiding this comment

bricknerb Jan 8, 2025

Choose a reason for hiding this comment

jonmeow commented Jan 8, 2025

jonmeow left a comment

Choose a reason for hiding this comment

jonmeow Jan 8, 2025

Choose a reason for hiding this comment

jonmeow Jan 8, 2025

Choose a reason for hiding this comment

jonmeow Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

jonmeow Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

zygoloid commented Jan 8, 2025

bricknerb commented Jan 9, 2025

bricknerb commented Jan 9, 2025

zygoloid commented Jan 7, 2025 •

edited by bricknerb

Loading

jonmeow Jan 8, 2025 •

edited

Loading

jonmeow Jan 8, 2025 •

edited

Loading