Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Distributed] Correct handling ad-hoc requirements in non-final classes #80109

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

ktoso
Copy link
Contributor

@ktoso ktoso commented Mar 19, 2025

Reproduces #79318

The problem is that the protocol requirements for those "ad-hoc" requirements are declared as:

protocol DistributedTargetInvocationResultHandler {
  func onReturn<Success/*: SerializationRequirement*/>(value: Success) async throws

...

and then inside EmitPolymorphicParameters::injectAdHocDistributedRequirements we detect those few requirements that get this treatment, and we add the Success: SerializationRequirement constraints.

This works in debug mode, but as #79318 uncovered, it is not quite enough and will crash in release mode. Specifically, we add the witness information like this:

    IGF.setUnscopedLocalTypeData(
        archetypeTy,
        LocalTypeDataKind::forAbstractProtocolWitnessTable(proto),
        witnessTable);

which seems is not enough in release mode (?) or something gets optimized away, but could not figure out what exactly would even be expected to "be" there.

The crash manifests as an assertion about an invalid requirement which is because we're missed to add (or... lost somehow, due to some optimization?) the : Transferable the inject was adding. For reference, here's where we end up triggering the assertion:

Assertion failed: (!isInvalid()), function getRequirement, file ProtocolConformanceRef.cpp, line 52.
* thread #1, queue = 'com.apple.main-thread', stop reason = hit program assert
    frame #0: 0x0000000185223720 libsystem_kernel.dylib`__pthread_kill + 8
    frame #1: 0x000000018525bf70 libsystem_pthread.dylib`pthread_kill + 288
    frame #2: 0x0000000185168908 libsystem_c.dylib`abort + 128
    frame #3: 0x0000000185167c1c libsystem_c.dylib`__assert_rtn + 284
  * frame #4: 0x0000000105835dec swift-frontend`swift::ProtocolConformanceRef::getRequirement() const (.cold.1) at ProtocolConformanceRef.cpp:52:3 [opt]
    frame #5: 0x00000001018e3cec swift-frontend`swift::ProtocolConformanceRef::getRequirement(this=<unavailable>) const at ProtocolConformanceRef.cpp:52:3 [opt]
    frame #6: 0x00000001006b2104 swift-frontend`swift::irgen::emitWitnessTableRef(IGF=0x000000016fdf5248, srcType=CanType @ x19, srcMetadataCache=0x000000016fdf3500, conformance=ProtocolConformanceRef @ 0x000000016fdf3478) at GenProto.cpp:3779:28 [opt]
    frame #7: 0x00000001006b2d94 swift-frontend`swift::irgen::emitGenericRequirementFromSubstitutions(swift::irgen::IRGenFunction&, swift::GenericRequirement, swift::MetadataState, swift::SubstitutionMap, bool) [inlined] swift::irgen::emitWitnessTableRef(IGF=0x000000016fdf5248, srcType=CanType @ x20, conformance=<unavailable>) at GenProto.cpp:3771:10 [opt]
    frame #8: 0x00000001006b2d80 swift-frontend`swift::irgen::emitGenericRequirementFromSubstitutions(IGF=0x000000016fdf5248, requirement=GenericRequirement @ 0x000000016fdf3770, metadataState=Complete, subs=SubstitutionMap @ 0x000000016fdf3728, onHeapPacks=false) at GenProto.cpp:4135:12 [opt]
    frame #9: 0x00000001006b88f4 swift-frontend`void llvm::function_ref<void (swift::GenericRequirement)>::callback_fn<(anonymous namespace)::EmitPolymorphicArguments::emit(swift::SubstitutionMap, swift::irgen::WitnessMetadata*, swift::irgen::Explosion&)::$_0>(long, swift::GenericRequirement) [inlined] (anonymous namespace)::EmitPolymorphicArguments::emit(swift::SubstitutionMap, swift::irgen::WitnessMetadata*, swift::irgen::Explosion&)::$_0::operator()(this=0x000000016fdf3920, requirement=<unavailable>) const at GenProto.cpp:3935:7 [opt]
    frame #10: 0x00000001006b88dc swift-frontend`void llvm::function_ref<void (swift::GenericRequirement)>::callback_fn<(anonymous namespace)::EmitPolymorphicArguments::emit(swift::SubstitutionMap, swift::irgen::WitnessMetadata*, swift::irgen::Explosion&)::$_0>(callable=6171867424, params=<unavailable>) at STLFunctionalExtras.h:45:12 [opt]
    frame #11: 0x00000001006b8da8 swift-frontend`void llvm::function_ref<void (swift::GenericRequirement)>::callback_fn<(anonymous namespace)::PolymorphicConvention::enumerateUnfulfilledRequirements(llvm::function_ref<void (swift::GenericRequirement)> const&)::$_0>(long, swift::GenericRequirement) [inlined] llvm::function_ref<void (swift::GenericRequirement)>::operator()(this=<unavailable>, params=<unavailable>) const at STLFunctionalExtras.h:68:12 [opt]
    frame #12: 0x00000001006b8d8c swift-frontend`void llvm::function_ref<void (swift::GenericRequirement)>::callback_fn<(anonymous namespace)::PolymorphicConvention::enumerateUnfulfilledRequirements(llvm::function_ref<void (swift::GenericRequirement)> const&)::$_0>(long, swift::GenericRequirement) [inlined] (anonymous namespace)::PolymorphicConvention::enumerateUnfulfilledRequirements(llvm::function_ref<void (swift::GenericRequirement)> const&)::$_0::operator()(this=0x000000016fdf3950, requirement=GenericRequirement @ 0x000000016fdf37b0) const at GenProto.cpp:327:7 [opt]
    frame #13: 0x00000001006b8d1c swift-frontend`void llvm::function_ref<void (swift::GenericRequirement)>::callback_fn<(anonymous namespace)::PolymorphicConvention::enumerateUnfulfilledRequirements(llvm::function_ref<void (swift::GenericRequirement)> const&)::$_0>(callable=6171867472, params=<unavailable>) at STLFunctionalExtras.h:45:12 [opt]
    frame #14: 0x00000001006aa464 swift-frontend`swift::irgen::enumerateGenericSignatureRequirements(swift::CanGenericSignature, llvm::function_ref<void (swift::GenericRequirement)> const&) [inlined] llvm::function_ref<void (swift::GenericRequirement)>::operator()(this=0x000000016fdf3960, params=<unavailable>) const at STLFunctionalExtras.h:68:12 [opt]
    frame #15: 0x00000001006aa450 swift-frontend`swift::irgen::enumerateGenericSignatureRequirements(signature=CanGenericSignature @ 0x000000016fdf3850, callback=0x000000016fdf3960) at GenProto.cpp:309:11 [opt]
    frame #16: 0x00000001006b24a0 swift-frontend`swift::irgen::emitPolymorphicArguments(swift::irgen::IRGenFunction&, swift::CanTypeWrapper<swift::SILFunctionType>, swift::SubstitutionMap, swift::irgen::WitnessMetadata*, swift::irgen::Explosion&) [inlined] (anonymous namespace)::PolymorphicConvention::enumerateRequirements(this=0x000000016fdf38c0, callback=0x000000016fdf3960) at GenProto.cpp:320:10 [opt]
    frame #17: 0x00000001006b2494 swift-frontend`swift::irgen::emitPolymorphicArguments(swift::irgen::IRGenFunction&, swift::CanTypeWrapper<swift::SILFunctionType>, swift::SubstitutionMap, swift::irgen::WitnessMetadata*, swift::irgen::Explosion&) [inlined] (anonymous namespace)::PolymorphicConvention::enumerateUnfulfilledRequirements(this=0x000000016fdf38c0, callback=0x000000016fdf3938) at GenProto.cpp:325:3 [opt]
    frame #18: 0x00000001006b2480 swift-frontend`swift::irgen::emitPolymorphicArguments(swift::irgen::IRGenFunction&, swift::CanTypeWrapper<swift::SILFunctionType>, swift::SubstitutionMap, swift::irgen::WitnessMetadata*, swift::irgen::Explosion&) [inlined] (anonymous namespace)::EmitPolymorphicArguments::emit(this=0x000000016fdf38c0, subs=SubstitutionMap @ 0x000000016fdf3948, witnessMetadata=<unavailable>, out=<unavailable>) at GenProto.cpp:3933:3 [opt]
    frame #19: 0x00000001006b23e0 swift-frontend`swift::irgen::emitPolymorphicArguments(IGF=<unavailable>, origFnType=<unavailable>, subs=SubstitutionMap @ x21, witnessMetadata=0x000000016fdf3ac8, out=0x000000016fdf3b48) at GenProto.cpp:3923:45 [opt]
    frame #20: 0x000000010076da34 swift-frontend`(anonymous namespace)::IRGenSILFunction::visitFullApplySite(this=0x000000016fdf5248, site=FullApplySite @ 0x000000016fdf3d78) at IRGenSIL.cpp:3929:5 [opt]
    frame #21: 0x0000000100750538 swift-frontend`(anonymous namespace)::IRGenSILFunction::visitSILBasicBlock(swift::SILBasicBlock*) [inlined] swift::SILInstructionVisitor<(anonymous namespace)::IRGenSILFunction, void>::visit(this=0x000000016fdf5248, inst=0x00000001278a0180) at SILVisitor.h:0:5 [opt]
    frame #22: 0x00000001007504d8 swift-frontend`(anonymous namespace)::IRGenSILFunction::visitSILBasicBlock(this=<unavailable>, BB=<unavailable>) at IRGenSIL.cpp:2860:5 [opt]
    frame #23: 0x000000010074ecc0 swift-frontend`(anonymous namespace)::IRGenSILFunction::emitSILFunction(this=<unavailable>) at IRGenSIL.cpp:2720:5 [opt]
    frame #24: 0x000000010074c518 swift-frontend`swift::irgen::IRGenModule::emitSILFunction(this=0x000000016fdf5e40, f=0x000000015904bce0) at IRGenSIL.cpp:2568:30 [opt]
    frame #25: 0x00000001005fb620 swift-frontend`swift::irgen::IRGenerator::emitLazyDefinitions(this=0x000000016fdf8320) at GenDecl.cpp:1446:12 [opt]
    frame #26: 0x0000000100703350 swift-frontend`swift::IRGenRequest::evaluate(this=<unavailable>, evaluator=<unavailable>, desc=IRGenDescriptor @ 0x000000016fdf8bc8) const at IRGen.cpp:1297:11 [opt]
    frame #27: 0x000000010074b764 swift-frontend`swift::GeneratedModule swift::SimpleRequest<swift::IRGenRequest, swift::GeneratedModule (swift::IRGenDescriptor), (swift::RequestFlags)17>::callDerived<0ul>(this=<unavailable>, evaluator=<unavailable>, (null)=<unavailable>) const at SimpleRequest.h:287:24 [opt]
    frame #28: 0x000000010070a610 swift-frontend`swift::IRGenRequest::OutputType swift::Evaluator::getResultUncached<swift::IRGenRequest, swift::IRGenRequest::OutputType swift::evaluateOrFatal<swift::IRGenRequest>(swift::Evaluator&, swift::IRGenRequest)::'lambda'()>(this=0x000000015700c880, request=0x000000016fdf8db0, defaultValueFn=<unavailable>) at Evaluator.h:347:19 [opt]
    frame #29: 0x00000001007041c8 swift-frontend`swift::performIRGeneration(swift::ModuleDecl*, swift::IRGenOptions const&, swift::TBDGenOptions const&, std::__1::unique_ptr<swift::SILModule, std::__1::default_delete<swift::SILModule>>, llvm::StringRef, swift::PrimarySpecificPaths const&, llvm::ArrayRef<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>>, llvm::GlobalVariable**) [inlined] swift::IRGenRequest::OutputType swift::Evaluator::operator()<swift::IRGenRequest, swift::IRGenRequest::OutputType swift::evaluateOrFatal<swift::IRGenRequest>(swift::Evaluator&, swift::IRGenRequest)::'lambda'(), (void*)0>(this=<unavailable>, request=0x000000016fdf8db0, defaultValueFn=<unavailable>) at Evaluator.h:235:12 [opt]
    frame #30: 0x00000001007041bc swift-frontend`swift::performIRGeneration(swift::ModuleDecl*, swift::IRGenOptions const&, swift::TBDGenOptions const&, std::__1::unique_ptr<swift::SILModule, std::__1::default_delete<swift::SILModule>>, llvm::StringRef, swift::PrimarySpecificPaths const&, llvm::ArrayRef<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>>, llvm::GlobalVariable**) [inlined] swift::IRGenRequest::OutputType swift::evaluateOrFatal<swift::IRGenRequest>(eval=<unavailable>, req=IRGenRequest @ 0x000000016fdf8db0) at Evaluator.h:448:10 [opt]
    frame #31: 0x00000001007041bc swift-frontend`swift::performIRGeneration(M=0x0000000157847be8, Opts=<unavailable>, TBDOpts=0x0000000157821b38, SILMod=<unavailable>, ModuleName=<unavailable>, PSPs=<unavailable>, parallelOutputFilenames=(Data = Summary Unavailable, Length = 1), outModuleHash=0x000000016fdf98e0) at IRGen.cpp:1697:10 [opt]
    frame #32: 0x000000010027f7b0 swift-frontend`generateIR(IRGenOpts=0x0000000157821630, TBDOpts=0x0000000157821b38, SM=nullptr, PSPs=0x000000016fdf9fc0, OutputFilename=<unavailable>, MSF=<unavailable>, HashGlobal=0x000000016fdf98e0, parallelOutputFilenames=<unavailable>) at FrontendTool.cpp:1464:12 [opt]
    frame #33: 0x000000010027c48c swift-frontend`performCompileStepsPostSILGen(Instance=0x0000000157820200, SM=nullptr, MSF=swift::ModuleOrSourceFile @ 0x000000016fdf9920, PSPs=0x000000016fdf9fc0, ReturnValue=<unavailable>, observer=<unavailable>) at FrontendTool.cpp:1820:7 [opt]
    frame #34: 0x000000010027bc60 swift-frontend`swift::performCompileStepsPostSema(Instance=0x0000000157820200, ReturnValue=<unavailable>, observer=<unavailable>) at FrontendTool.cpp:761:12 [opt]

We also notice that this problem does not happen when the type is a struct, and it seems to be related with how the try_apply is made to the function because if we make (sorry example below has remoteCall, but it is the same issue as the onReturn) the call to such method using a function ref:

  %8 = function_ref @$s17DistributedSystemAAC10remoteCall2on6target10invocation8throwing9returningq0_x_0A006RemoteD6TargetVAA0jD7EncoderVzq_mq0_mtYaKAI0A5ActorRzs5ErrorR_AA12TransferableR0_AA18EndpointIdentifierV2IDRtzr1_lF : $@convention(method) @async <τ_0_0, τ_0_1, τ_0_2 where τ_0_0 : DistributedActor, τ_0_1 : Error, τ_0_2 : Transferable, τ_0_0.ID == EndpointIdentifier> (@guaranteed τ_0_0, @in_guaranteed RemoteCallTarget, @inout RemoteCallEncoder, @thick τ_0_1.Type, @thick τ_0_2.Type, @guaranteed DistributedSystem) -> (@out τ_0_2, @error any Error) // user: %9
  try_apply %8<τ_0_0, τ_0_1, τ_0_2>(%0, %1, %2, %3, %4, %5, %7) : $@convention(method) @async <τ_0_0, τ_0_1, τ_0_2 where τ_0_0 : DistributedActor, τ_0_1 : Error, τ_0_2 : Transferable, τ_0_0.ID == EndpointIdentifier> (@guaranteed τ_0_0, @in_guaranteed RemoteCallTarget, @inout RemoteCallEncoder, @thick τ_0_1.Type, @thick τ_0_2.Type, @guaranteed DistributedSystem) -> (@out τ_0_2, @error any Error), normal bb1, error bb2 // id: %9

We fail; because we're missing the Transferable type information. If the call is made as a class_method call like this (which happens in debug mode):

// protocol witness for DistributedActorSystem.remoteCall<A, B, C>(on:target:invocation:throwing:returning:) in conformance DistributedSystem
sil shared [transparent] [thunk] @$s4main17DistributedSystemC0B00b5ActorC0AadEP10remoteCall2on6target10invocation8throwing9returningqd_1_qd___AD06RemoteF6TargetV17InvocationEncoderQzzqd_0_mqd_1_mtYaKAD0bD0Rd__s5ErrorRd_0_2IDQyd__0dQ0Rtzr1_lFTW : $@convention(witness_method: DistributedActorSystem) @async <τ_0_0, τ_0_1, τ_0_2 where τ_0_0 : DistributedActor, τ_0_1 : Error, τ_0_0.ID == String> (@guaranteed τ_0_0, @in_guaranteed RemoteCallTarget, @inout RemoteCallEncoder, @thick τ_0_1.Type, @thick τ_0_2.Type, @in_guaranteed DistributedSystem) -> (@out τ_0_2, @error any Error) {
...
[global: read,write,copy,destroy,allocate,deinit_barrier]
...
bb0(%0 : $*τ_0_2, %1 : $τ_0_0, %2 : $*RemoteCallTarget, %3 : $*RemoteCallEncoder, %4 : $@thick τ_0_1.Type, %5 : $@thick τ_0_2.Type, %6 : $*DistributedSystem):
  %7 = load %6                                    // users: %8, %9
  %8 = class_method %7, #DistributedSystem.remoteCall : <Actor, Err, Res where Actor : DistributedActor, Err : Error, Res : Transferable, Actor.ID == String> (DistributedSystem) -> (Actor, RemoteCallTarget, inout RemoteCallEncoder, Err.Type, Res.Type) async throws -> Res, $@convention(method) @async <τ_0_0, τ_0_1, τ_0_2 where τ_0_0 : DistributedActor, τ_0_1 : Error, τ_0_2 : Transferable, τ_0_0.ID == String> (@guaranteed τ_0_0, @in_guaranteed RemoteCallTarget, @inout RemoteCallEncoder, @thick τ_0_1.Type, @thick τ_0_2.Type, @guaranteed DistributedSystem) -> (@out τ_0_2, @error any Error) // user: %9
  try_apply %8<τ_0_0, τ_0_1, τ_0_2>(%0, %1, %2, %3, %4, %5, %7) : $@convention(method) @async <τ_0_0, τ_0_1, τ_0_2 where τ_0_0 : DistributedActor, τ_0_1 : Error, τ_0_2 : Transferable, τ_0_0.ID == String> (@guaranteed τ_0_0, @in_guaranteed RemoteCallTarget, @inout RemoteCallEncoder, @thick τ_0_1.Type, @thick τ_0_2.Type, @guaranteed DistributedSystem) -> (@out τ_0_2, @error any Error), normal bb1, error bb2 // id: %9

This works because it seems the first thing the class method does is "get" that type information using a conformsToProtocol...

Not really sure what to explore further here. Key finding is that this is specifically a release mode issue, and that we're then missing the t_0_2: Transferable bit that our injectAdHocDistributedRequirements would have added. What could be different here in a release mode?

rdar://146101172

@ktoso ktoso force-pushed the wip-reproduce-146101172 branch from e8485c7 to a0ccbcb Compare March 25, 2025 10:59
@ktoso ktoso force-pushed the wip-reproduce-146101172 branch 2 times, most recently from d8dba5d to b4ff64e Compare March 25, 2025 13:00
@ktoso
Copy link
Contributor Author

ktoso commented Mar 25, 2025

@swift-ci please smoke test
@swift-ci please smoke test

@ktoso ktoso added the distributed Feature → concurrency: distributed actor label Mar 26, 2025
@ktoso ktoso force-pushed the wip-reproduce-146101172 branch from b4ff64e to 3c88726 Compare March 26, 2025 09:54
@ktoso ktoso changed the title [Distributed] Reproduce conformance issue for further investigation [Distributed] Correct handling ad-hoc requirements in non-final classes Mar 26, 2025
@ktoso
Copy link
Contributor Author

ktoso commented Mar 26, 2025

@swift-ci please smoke test

@ktoso
Copy link
Contributor Author

ktoso commented Mar 26, 2025

With some hints from @slavapestov I moved the witness generic signature fixing into lib/AST/RequirementEnvironment.cpp. This seems pretty good so far...

However we have two very mysterious failures remaining:


Failed Tests (2):
  Swift(macosx-arm64) :: Distributed/Runtime/distributed_actor_generic_codable_da_conformance.swift
  Swift(macosx-arm64) :: Distributed/Runtime/distributed_actor_remoteCall_protocol_method_in_presence_of_multiple_systems.swift

these are weird because the test is not very different than other remoteCall ones... And especially, the -O one: Distributed/Runtime/distributed_actor_remoteCall_roundtrip_optimized_or_not.swift works under this new change... Those tests which do pass do exercise the encoder, but can't seem to pinpoint what's different with these two, must be the conformance after all somehow?

The failures of the two above are pretty mysterious because the backtrace is "nothing" so we must have messed up something real bad:

(lldb) target create "/Users/ktoso/code/swift-project/build/Ninja-DebugAssert/swift-macosx-arm64/test-macosx-arm64/Distributed/Runtime/Output/distributed_actor_remoteCall_protocol_method_in_presence_of_multiple_systems.swift.tmp/a.out"
Current executable set to '/Users/ktoso/code/swift-project/build/Ninja-DebugAssert/swift-macosx-arm64/test-macosx-arm64/Distributed/Runtime/Output/distributed_actor_remoteCall_protocol_method_in_presence_of_multiple_systems.swift.tmp/a.out' (arm64).
(lldb) run
Process 95890 launched: '/Users/ktoso/code/swift-project/build/Ninja-DebugAssert/swift-macosx-arm64/test-macosx-arm64/Distributed/Runtime/Output/distributed_actor_remoteCall_protocol_method_in_presence_of_multiple_systems.swift.tmp/a.out' (arm64)
| assign id: ActorAddress(address: "<unique-id>") for DAFR
| actor ready: main.DAFR
| resolve ActorAddress(address: "<unique-id>") as remote // this system always resolves as remote
resolved on main.FakeRoundtripActorSystem: $GreeterProtocol<FakeRoundtripActorSystem>
 > encode generic sub: main.$GreeterProtocol<main.FakeRoundtripActorSystem>
 > encode return type: Swift.String
 > done recording
  >> remoteCall: on:main.$GreeterProtocol<main.FakeRoundtripActorSystem>, target:main.$GreeterProtocol.greet(), invocation:FakeInvocationEncoder(genericSubs: [main.$GreeterProtocol<main.FakeRoundtripActorSystem>], arguments: [], returnType: Optional(Swift.String), errorType: nil), throwing:Swift.Never, returning:Swift.String
 > execute distributed target: main.$GreeterProtocol.greet(), identifier: $s4main16$GreeterProtocolC5greetSSyYaKFTE
  > decode generic subs: [main.$GreeterProtocol<main.FakeRoundtripActorSystem>]
  > decode return type: Swift.String
 << onReturn: DAFR
  << remoteCall return: DAFR
Process 95890 stopped
* thread #2, queue = 'com.apple.root.default-qos.cooperative', stop reason = EXC_BAD_ACCESS (code=1, address=0x0)
    frame #0: 0x0000000000000000
error: memory read failed for 0x0
Target 0: (a.out) stopped.
(lldb) bt
* thread #2, queue = 'com.apple.root.default-qos.cooperative', stop reason = EXC_BAD_ACCESS (code=1, address=0x0)
  * frame #0: 0x0000000000000000

The other one:

Process 643 launched: '/Users/ktoso/code/swift-project/build/Ninja-DebugAssert/swift-macosx-arm64/test-macosx-arm64/Distributed/Runtime/Output/distributed_actor_remoteCall_protocol_method_in_presence_of_multiple_systems.swift.tmp/a.out' (arm64)
| assign id: ActorAddress(address: "<unique-id>") for DAFR
| actor ready: main.DAFR
| resolve ActorAddress(address: "<unique-id>") as remote // this system always resolves as remote
resolved on main.FakeRoundtripActorSystem: $GreeterProtocol<FakeRoundtripActorSystem>
 > encode generic sub: main.$GreeterProtocol<main.FakeRoundtripActorSystem>
 > encode return type: Swift.String
 > done recording
  >> remoteCall: on:main.$GreeterProtocol<main.FakeRoundtripActorSystem>, target:main.$GreeterProtocol.greet(), invocation:FakeInvocationEncoder(genericSubs: [main.$GreeterProtocol<main.FakeRoundtripActorSystem>], arguments: [], returnType: Optional(Swift.String), errorType: nil), throwing:Swift.Never, returning:Swift.String
 > execute distributed target: main.$GreeterProtocol.greet(), identifier: $s4main16$GreeterProtocolC5greetSSyYaKFTE
  > decode generic subs: [main.$GreeterProtocol<main.FakeRoundtripActorSystem>]
  > decode return type: Swift.String
 << onReturn: DAFR
  << remoteCall return: DAFR
Process 643 stopped
* thread #2, queue = 'com.apple.root.default-qos.cooperative', stop reason = EXC_BAD_ACCESS (code=2, address=0x16fe86e30)
    frame #0: 0x000000016fe86e30
->  0x16fe86e30: udf    #0x0
    0x16fe86e34: udf    #0x0
    0x16fe86e38: udf    #0x0
    0x16fe86e3c: udf    #0x0
Target 0: (a.out) stopped.
(lldb) bt
* thread #2, queue = 'com.apple.root.default-qos.cooperative', stop reason = EXC_BAD_ACCESS (code=2, address=0x16fe86e30)
  * frame #0: 0x000000016fe86e30
    frame #1: 0x000000016fe86e30

The crashes are pretty nasty, as if we messed up where we jump - so the witnesses are somehow messed up, but unclear why only in those two tests, while other remoteCall roundtrip ones seemed fine hm. Stepping through the runtime bits of remoteCall etc isn't very informative sadly.

Slava also suggested we may need to remove the newly added requirements from IR, but I'm not quite sure where to do that.

Removing the setUnscopedLocalTypeData part we do in EmitPolymorphicParameters::injectAdHocDistributedRequirements results in a hang that I'm yet to debug, but somewhat makes me thing we'll want to keep this part.

@xedin @mikeash maybe you have some ideas what we might need to look into here, if you'd skim the change -- primarily inside the RequirementEnvironment?

@@ -762,14 +762,11 @@ struct FakeInvocationEncoder_recordArgument_wrongType: DistributedTargetInvocati
mutating func doneRecording() throws {}
}
struct FakeInvocationEncoder_recordArgument_missingMutating: DistributedTargetInvocationEncoder {
//expected-error@-1{{type 'FakeInvocationEncoder_recordArgument_missingMutating' does not conform to protocol 'DistributedTargetInvocationEncoder'}}
//expected-error@-1{{struct 'FakeInvocationEncoder_recordArgument_missingMutating' is missing witness for protocol requirement 'recordArgument'}}
Copy link
Contributor Author

@ktoso ktoso Mar 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was debugging why the hell only the recordArgument changed behavior here and could not figure it out so far... all the checks are the same after all hm. May be a clue, I don't know yet.

This isn't wrong though, we still do miss the witness as expected after all -- since it has missing the mutating and logic to detect that still works after all hm

@ktoso
Copy link
Contributor Author

ktoso commented Mar 26, 2025

Ok even weirder, the Linux build fails (but is fine on macOS) like


/home/build-user/swift/stdlib/public/Distributed/LocalTestingDistributedActorSystem.swift:164:15: error: struct 'LocalTestingInvocationEncoder' is missing witness for protocol requirement 'recordReturnType'
162 | 
163 | @available(SwiftStdlib 5.7, *)
164 | public struct LocalTestingInvocationEncoder: DistributedTargetInvocationEncoder {
    |               |- error: struct 'LocalTestingInvocationEncoder' is missing witness for protocol requirement 'recordReturnType'
    |               `- note: add stubs for conformance
165 |   public typealias SerializationRequirement = Codable
166 | 

which did build fine on macOS, which makes me wonder what's up here... definitely more digging required.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
distributed Feature → concurrency: distributed actor
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant