First, thanks for the very kind regards. And solving problems with CF is what I love to do, and I do it about all day each day...albeit normally on a consulting basis. But I bring that experience to bear here in the community, especially on such knotty problems (we have a couple going on here in the forums this week).
1) So for anyone who may look at Matt's log attachment that he shared, note that it's not ONE log. Instead he shareds a bit of a couple of CF runs, then adds some commentary (separated by -------------- lines to help), and then more CF runs, then more comments. That wouldn't be obvious to some, who may just start "searching" the log for info and perhaps would not see the context he offers there.
2) I confirmed that the uninstalling only shows up for the one startup after he had applied updated. Good thing I was careful to couch my proposal above (that he might find package updates happening on each start) as merely a possibility. 🙂
3) So what can we take away from all the info he's shared? Well, I don't see any ready explanation in CF's log lines. But he says here and a bit more in the log text that the last problem happened after a box restart...which they'd done because of a memory problem in CF (with I suspect high cpu). That may be were the problem lies. More on that in a moment.
4) I'll note first that you could have avoided the box reboot by just killing the CF task. I do realize you likely tried to stop the service (which would "shut it down gracefully") but you probably found it "wouldn't stop". That happens when memory runs out, GCs get excessive, and CF CPU takes over most of the machine.
And since I can see from the logs you're on Windows, I suspect further that you either gave up waiting for the progress bar of the service to finish, or it popped up saying it couldn't be started. Here's good news in that situation: if you'd waited just another minute then CF would have killed the process. But perhaps you didn't want to wait: in that case, you could have killed the coldfusion.exe process yourself in task manager (not something to do regularly, for reasons I'm about to explain).
But FWIW, when you opted to reboot the box, the same effect could have happened: Windows again tells all services, "hey y'all. shut down." But if they don't, then IT just kills the whole running OS.
5) Now, what's this got to do with your scenario? Well, maybe somehow when CF doesn't go down gracefully (is killed by you, or by Services not being able to shut it down gracefully, or by a box reboot), perhaps THAT is when something goes amiss with the felix-cache. This is just a guess.
I just don't know enough about how the felix-cache internal processing works. Perhaps someone else or Adobe could chime in. (It's not clear they are reading this thread or will. You may want to open a ticket at tracker.adobe.com reporting the issue. Let us know the ticket number if you do.)
6) If this is indeed the issue, then that's a hard thing to "monitor" (going back to your original idea). There's no mechanism I know of that tracks if CF did or did not shut down gracefully in its previous run.
You could "watch out for the problem" manually: if you look at that coldfusion-out.log, you'll notice you don't see lines about it "stopping", like you showed when it "went down normally", for example (from your log):
Jul 10, 2025 20:15:59 PM Information [Thread-12] - Monitoring Service stopped.
Jul 10, 2025 20:15:59 PM Information [FelixStartLevel] - Monitoring Service stopped.
Jul 10, 2025 20:15:59 PM Information [Thread-12] - ColdFusion stopped
So if I ever DO wonder if someone/something "killed" CF, this is what I look for. Or if I'm reviewing logs with folks and I see it does NOT log that (in coldfusion-outl.log, or similar but different lines in coldfusion-error.log, such as "A valid shutdown command was received via the shutdown port. Stopping the Server instance"), that tells me CF crashed or was killed (and usually I'd want to find out why).
I'm just saying that's ONE thing you could watch out for. Sadly I'm not aware of even any means to programatically tell how long CF has been up (so that you could watch for when it may have recently "started", then look to see if those log lines are missing--in which case you could proactively stop CF and clear the felix-cache and start it again.)
But I will note at least that if you have FusionReactor, that DOES show on its metrics<web metrics page (in the text below the graphs) when CF "started" (it refers to it as "server started" but it IS referring to CF, or Lucee, or BoxLang, or Tomcat, or Wildfly, or whatever Java app.app server FR is monitoring--which is why they just say "server", though many misread that as "the server".)
Sorry I don't have better news. Again, maybe someone else will, or Adobe may get back to you if you file a ticket. Please do keep us posted. I've not heard of this happening to many others, but it certainly COULD be happening to others. I hoep we may find some "better" explanation for your situation, as it would be lamentable if a mere "hard crash" of CF could lead to this need to clear the felix-cache. I actually hope it's proven that it's NOT the case, actually.
... View more