better flesh out operator privileges in other Silos #1681

davepacheco · 2022-09-07T19:30:28Z

#1340 proposed that users with fleet-level privileges would have no privileges to access siloed resources in Silos other than their own. (This phrasing wasn't really fleshed out until RFD 297, but I believe that's essentially what #1340 meant.) The principle was essentially: we shouldn't be deciding who's allowed to cross Silo lines -- if an operator wants to access another Silo, they can do that, but they do it by having an account in that Silo's IdP, which makes it noisy and auditable.

RFD 309 raises a number of user stories that cast some doubt on this approach. To be clear, I'm not sure yet what the right answer is, but there are enough questions that I think we want to revisit this before committing too far one way or the other.

CC @plotnick @rmustacc @kc8apf

zephraph · 2023-05-17T19:01:44Z

I don't think it's quite clear to me how 309 casts doubt on the separation. We've always acknowledged that operators are going to need some lens into silo resources to be able to debug escalated issues, but my understand is that we still want to preserve the separation so that operators don't have unnecessary access to what might be confidential details.

#3092 is an example of where we're bending the boundaries a bit by making project/instance names visible to the operator. While this gives operators more insight into silo specific resources it does so to aid in establishing shared communication between the operator and the developer. I chose very specifically to only display the minimal context that the operator needs in this case.

davepacheco · 2023-05-17T21:30:41Z

I understood #1340 (and my summary in this comment) to be proposing that Fleet Administrators would have no privileges to see just about anything inside other Silos. Not the list of projects, instances, or anything like that. I agree that's too rigid and RFD 309 explains why. This ticket is about figuring out what these boundaries really are and ideally what principles guide them: e.g., provide access when there's particular value provided by a holistic, cross-Silo view; or where it's not even possible to correlate things without it: e.g., if a Fleet Admin can see sleds but not instances, and the Silo Admin can see instances but not sleds, then literally nobody can tell you what instances are on what sleds.

davepacheco · 2023-07-07T21:39:21Z

When we do this, we should re-evaluate:

whether it makes sense to allow Silo and Project policies to refer to identities outside the Silo. See Fleet privileges should not cascade into Silos #1580.
disallow fleet administrators from reading or modifying the policy on the Silo. See Fleet privileges should not cascade into Silos #1580 for details. Since fleet admins need to be able to modify which group is the "admin" group for a Silo, they may wind up changing the policy, but would not have carte blanche to change it however they like.

askfongjojo · 2025-01-31T02:35:18Z

A couple of us met today to try putting this long-standing open item to rest.
@davepacheco @smklein @inickles. Please feel free to make edits or additional comments as you see fit.

Here are the key determinations we've made (and I'll follow up with a short RFD that essentially collates these decisions with meeting notes, background info, related tickets to make them more visible/formal):

Stick to the current model of strict tenancy separation as described in omicron#1340 for all silo resource API. User with fleet-level roles (admin or viewer - whom we'll reference as "operators" below) will need to be explicitly granted the appropriate silo role if they want to view or act on resources in certain silos.
For other use cases that allow a user to consume fleet-level information for audit or support purposes (e.g., someone in SecOps), we'll come up with new IAM roles to make the access highly constrained and explicit. This overrides the design stated in RFD 523 (audit logs) and RFD 496 (support bundle) that makes these fleet-wide artifacts accessible to operators by default. In other words, operators will get audit log and support bundle access only if they explicitly grant themselves the corresponding new roles; users who have these new roles only will not have access to other operator/system API or UI. We'll not attempt to redact resource names or other potentially sensitive information (e.g., IP addresses) until this is mandated by customers.
For the special case of enumerating resources residing on hardware components to determine maintenance or fault impact (today, the only such example is sled_instance_list but we may have similar APIs for disks and IP addresses in the future), we should use IDs everywhere as resource identifiers to avoid leaking any sensitive information conveyed in resource names.
For any future use cases that present a strong need to allow operator access to silo resources, we'll likely enhance the IAM model to provide more fine-grained roles that govern what specific actions/attributes users can take or see for each resource.

david-crespo · 2025-01-31T02:50:50Z

This will be interesting for the web console. We only have system and silo sides. We’ll have to either add a third section that only shows up for users with this role, or we reuse the system section for audit role-only users but the only thing in the sidebar is the thing they have access to.

askfongjojo · 2025-02-01T02:33:34Z

I've put out RFD 550 to get broader visibility and feedback on the determinations. I think we can close this issue in favor of having follow-up discussions and next steps tracked in the RFD and its PR.

This was referenced Sep 7, 2022

tracking issue for MVP IAM work #849

Closed

Fleet privileges should not cascade into Silos #1580

Merged

davepacheco mentioned this issue May 2, 2023

set up Recovery Silo and user during rack initialization #2943

Merged

davepacheco mentioned this issue May 17, 2023

silo creation should accept TLS certificates #3138

Merged

davepacheco added this to the MVP+1 milestone Jul 7, 2023

davepacheco mentioned this issue Jul 14, 2023

Bootstrapping issue with silo creation outside of RSS #3625

Closed

davepacheco mentioned this issue Oct 13, 2023

Use opctx_alloc in start saga to query boundary switches #4274

Merged

davepacheco mentioned this issue Mar 22, 2024

Strong silo isolation #1340

Closed

davepacheco mentioned this issue May 31, 2024

Read group permission for fleet users #5746

Open

davepacheco mentioned this issue Sep 18, 2024

OMDB should participate in authz #6600

Open

askfongjojo closed this as completed Feb 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

better flesh out operator privileges in other Silos #1681

better flesh out operator privileges in other Silos #1681

davepacheco commented Sep 7, 2022

zephraph commented May 17, 2023

davepacheco commented May 17, 2023

davepacheco commented Jul 7, 2023

askfongjojo commented Jan 31, 2025 •

edited

Loading

david-crespo commented Jan 31, 2025 •

edited

Loading

askfongjojo commented Feb 1, 2025

better flesh out operator privileges in other Silos #1681

better flesh out operator privileges in other Silos #1681

Comments

davepacheco commented Sep 7, 2022

zephraph commented May 17, 2023

davepacheco commented May 17, 2023

davepacheco commented Jul 7, 2023

askfongjojo commented Jan 31, 2025 • edited Loading

david-crespo commented Jan 31, 2025 • edited Loading

askfongjojo commented Feb 1, 2025

askfongjojo commented Jan 31, 2025 •

edited

Loading

david-crespo commented Jan 31, 2025 •

edited

Loading