New Azure Function App processes blobs that were already processed by another Function App

Question

I have the following situation. I had a function that ran on Windows App Service Plan. That function processed blobs. The function had default LogsAndContainerScan trigger. Now after some time I decided to rewrite this function and also migrate it from Windows to Linux, I also wanted to deploy it in an isolated environment inside a docker container. To accomplish this I createad another Function App that was running on a new App Service Plan for Linux. During the deployment I deployed and started a new function app on Linux, and stopped the old one for Windows.

To my big surprise, the new function started to process the blobs that were processed long ago by the previous function. After some digging and reading answers on Stack Overflow for example this one or this one, it seems to me that the function will process a blob only if it does not have a blob receipt inside azure-webjobs-hosts blob container. When I looked at my azure-webjobs-hosts blob container I found out that there are actually two folders in there - one for my previous function, and one for my new function. So I conclude that even though there were receipts for the existing blobs, they were in the folder of the old function app, which means that when I created a new function app, it tried to find the receipts in another folder, couldn't find them, so it started to process all of the blobs again. Which basically means that whenever I decide to create another function app with a blob trigger, it will try to reprocess all of the existing files.

The question that I have.

Is my reasoning above correct, and every function app reprocess the blobs again that were processed before? If no why did it happen in my situation?
Is there any way I can avoid this situation in the future, when I, for example, decide to create yet another function app that will operate on the same blob container?

Ben Gimblett · Accepted Answer · 2024-03-27 14:32:39Z

-1

I don't know enough about the default trigger to comment on #1

To answer #2 I would consider using an event grid event. This way there's no polling and events are received only when the blob is created. So setting up a new function against an existing storage container would imply an invocation only for new blobs that are subsequently added. Noting that event grid guarantees "at least once delivery" (due to possibility of transient faults) so you do need to account for that. REF https://learn.microsoft.com/en-us/azure/azure-functions/storage-considerations?tabs=azure-cli#trigger-on-a-blob-container

answered Mar 27, 2024 at 14:32

Ben Gimblett

111 silver badge5 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

New Azure Function App processes blobs that were already processed by another Function App

1 Answer 1

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related