Callback-based gRPC client with C interface #963

cretz · 2025-07-23T21:44:10Z

What was changed

Added temporal_client::callback_based module with a Tonic-compatible Tower service implementation that invokes a callback instead of making a network call
Added connect_no_namespace_with_service_override overload that accepts an optional callback service (and moved stuff from connect_no_namespace into there)
Adapted temporal_client::metrics::GrpcMetricSvc to work with channel or callback-based service
Added grpc_override_callback option in C bridge's ClientOptions that can be set with a C function for callback
Added supporting structures and methods to support C-based callbacks with a careful eye towards lifetimes
Added minimal test to confirm some behaviors

Missing/future features:

Cancellation support - There is currently no way to notify the callback implementer when a call needs to be canceled which is an important feature
Easy way to delegate to traditional Core client from inside callback - This is needed to be able to use the callback-based client as an interception mechanism for langs

cretz · 2025-07-23T21:51:51Z

client/src/metrics.rs

 impl Service<http::Request<Body>> for GrpcMetricSvc {
    type Response = http::Response<Body>;
-    type Error = tonic::transport::Error;
+    type Error = Box<dyn std::error::Error + Send + Sync>;


I do not believe changing this will cause any issues or serious performance concerns, but would like to have it double checked

core-c-bridge/src/client.rs

Sushisource

In general looking good!

client/src/callback_based.rs

Sushisource · 2025-07-23T21:56:58Z

client/src/callback_based.rs

+            let req = GrpcRequest {
+                service: path_parts.next().unwrap_or_default(),
+                rpc: path_parts.next().unwrap_or_default(),
+                headers: &parts.headers,


Why manipulate the body at all? If we pass through the compression flag and length, we can document that and allow the callback implementer to handle compression if they want

Because most gRPC clients deal in protos, not full bodies with the extra 5 bytes in front. For callback implementers that are, say, delegating to their own in-language clients, those clients operate on protobuf bodies, not raw HTTP ones. I don't think we should ask them to carve up the body to get the proto out (or put it back). If there is a use case for needing to know the compression byte, we can add it, but we are in control of the client call so it will always be what we want anyways.

Right, what I'm saying is the callback client isn't really a pure gRPC client, because it may want to delegate to something that implements compression too. So seems to me we should just document that and leave it to the caller.

They can't enable compression, we'd have to with our Tonic client which is what builds this body. But there's no value for us to do so in this case since it's all in memory. If they want to delegate to something that implements compression, they can/should. They can wrap the whole proto bytes in pre-negotiated compression if they'd like. But from our in-memory perspective, we give them proto bytes, how that's represented on the in-memory wire via the few bits of Rust code between Tonic and this callback should have no user effect. It's up to us and it should always be 0 (no compression) IMO. The compression byte is about pre-negotiated compression algorithm with the server, which doesn't apply to in-memory nor should it.

Overall, this compression is just a boolean used by gRPC HTTP, but has no value for in-memory representation and will always be 0. It doesn't compress the bytes or even tell you the algorithm, it's just a note saying the negotiated compression with the server is in effect (there is no server here). This is unrelated to whether a user wants to use compression to their upstream implementation.

Aaa, ok that makes sense

client/src/metrics.rs

Sushisource · 2025-07-23T22:17:23Z

core-c-bridge/src/client.rs

+                // We have to cast this to a literal pointer integer because we use spawn_blocking
+                // and Rust can't validate things in either of two approaches. The first approach,
+                // just moving the *mut to spawn_blocking closure, will not work because it is not
+                // send (even if you wrap it in a marked-send struct). The second, approach, moving
+                // the box to the closure and into_raw'ing it there won't work because Rust thinks
+                // the "req" param to spawn_blocking may outlive this closure even though we're
+                // confident in our oneshot use this will never happen.


This makes sense to me. AFAIK there is no safe way to express this (if you want to keep using spawn_blocking).

Because spawn_blocking by definition requires the possibility of using another thread, the Send requirement exists. Raw pointers are never send. The lifetime issue is also unsolvable: https://without.boats/blog/the-scoped-task-trilemma/

This explanation is great, but I also like a quick summary of // SAFETY: This is safe because the spawned task is guaranteed to be joined, and the box reclaimed, before this function exits

However, writing that - is it actually safe in the error case? Seems like we might need to double check there that the user did call the response callback, and free the pointer if they didn't.

Raw pointers are never send

Rustinomicon says you can cheat this (https://doc.rust-lang.org/nomicon/send-and-sync.html), but it still was not working for me when I tried due to other issues (but I have cheated like this before).

However, writing that - is it actually safe in the error case? Seems like we might need to double check there that the user did call the response callback, and free the pointer if they didn't.

If they didn't call the respond call, receiver.await never completes. And note, the only time the sender can ever be dropped is also in that same respond call.

Right but the error can get returned before that await point

Hrmm, so from my reading, in this situation spawn_blocking may return an error not caused by the user callback code panicking when 1) shutting down tokio runtime (is_cancelled) or 2) tokio cannot schedule the thread (a form of panic). Both should be rare, but I suppose technically possible. I will look into making an atomic free call on the error return there.

EDIT: Actually, I'm struggling to know whether it reached user code or not. We definitely don't want to free if it did right? What if user callback did the respond and then panicked (e.g. threw exception from their lang)? This would be the second free attempt but the memory would already be invalid so you can't check whether freed before. I can have some kind of send/arc bool I guess that I set just before invoking user callback so we know their code is responsible for freeing now.

Well that's the thing is you can't necessarily know (I agree it's very rare though). Having some bool flag that gets set when the response is called is what I was thinking.

core-c-bridge/src/client.rs

Callback-based gRPC client with C interface

d252337

cretz requested a review from a team as a code owner July 23, 2025 21:44

Disable type complexity check

461f3b5

cretz commented Jul 23, 2025

View reviewed changes

core-c-bridge/src/client.rs Show resolved Hide resolved

More lint fixes

76381a7

cretz commented Jul 23, 2025

View reviewed changes

core-c-bridge/src/client.rs Outdated Show resolved Hide resolved

Sushisource reviewed Jul 23, 2025

View reviewed changes

Minor PR updates

929a727

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Callback-based gRPC client with C interface #963

Callback-based gRPC client with C interface #963

Uh oh!

cretz commented Jul 23, 2025

Uh oh!

cretz Jul 23, 2025

Uh oh!

Uh oh!

Uh oh!

Sushisource left a comment

Uh oh!

Uh oh!

Uh oh!

Sushisource Jul 23, 2025

Uh oh!

cretz Jul 24, 2025 •

edited

Loading

Uh oh!

Sushisource Jul 25, 2025

Uh oh!

cretz Jul 25, 2025 •

edited

Loading

Uh oh!

Sushisource Jul 25, 2025

Uh oh!

Uh oh!

Sushisource Jul 23, 2025

Uh oh!

cretz Jul 24, 2025 •

edited

Loading

Uh oh!

Sushisource Jul 25, 2025

Uh oh!

cretz Jul 25, 2025 •

edited

Loading

Uh oh!

Sushisource Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

Callback-based gRPC client with C interface #963

Are you sure you want to change the base?

Callback-based gRPC client with C interface #963

Uh oh!

Conversation

cretz commented Jul 23, 2025

What was changed

Uh oh!

cretz Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Sushisource left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Sushisource Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

cretz Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Sushisource Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

cretz Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Sushisource Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Sushisource Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

cretz Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Sushisource Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

cretz Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Sushisource Jul 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cretz Jul 24, 2025 •

edited

Loading

cretz Jul 25, 2025 •

edited

Loading

cretz Jul 24, 2025 •

edited

Loading

cretz Jul 25, 2025 •

edited

Loading