GCode stopping (controller crash?), other issue making the machine unusable

Bottom line: New build - having lots of trouble diagnosing weird communication/electronic/software/controller issues that are making the machine unusable even as a pen-plotter. Losing motivation…

Setup:

*LR3 with 750mmX x 1500mmY rails, all PLA plates (for now)

*Ryan’s hardware kit (had to replace defective X stepper but otherwise all parts from the kit)

*Jackpot controller, preloaded firmware from Ryan - according to the boot message FluidNC 3.8.0

*Running in AP mode (experimenting with different devices to run interface, including chrome on Windows 10 Pro (Suface) and Kindle tablet)

*Had modified the stepoffs and steps/mm in the config, but reverted to a copy I had made pre-changes to Ryan’s original config.yaml to try to rule this out

*Running GCode uploaded to SD, making GCode with Estlcam

*I haven’t cut strut plates yet and (full disclosure) waiting to tidy up wiring until I’ve got those.

The issue(s):

*The biggest problem that makes the machine pretty much worthless until resolved is intermittently/unpredictably stopping (software/board crashing??) while running GCode.

*The machine responds to jog commands through the web interface sometimes instantly, sometimes delayed (seconds), sometimes not at all.

*The resume and “clear alarm” buttons never work on the FluidNC ESP3D interface. (This is problematic as it makes it more difficult to get past status messages, ie. where they are included in recommended probing scripts, tool changes, etc).

*When I run code and hit pause it does pause (on tablet mode, doesn’t recognize anything on the main interface page and keeps on running), but I can not resume at all and then nothing responds (won’t even reset the Jackpot controller, have to manually disconnect and reconnect power to get control again).

*When the controller hangs and I have to hard reset with the ESP buttons or disconnecting power I sometimes get “ERROR:152 Configuration is invalid. Check boot messages for ERR’s” but there are no errors listed in the boot messages, and I reverted to the copy I had made of the original config and am still getting this error here and there.

For clarity, these issues occur without the router plugged in (so not EM issues from that).

Some things I’ve tried:

*Reverting to original config.yaml (although I’m pretty sure some issues like the problems jogging where noted before I made changes).

*Watched for patterns for a triggering event when the GCode running hangs - can’t find a rhyme or reason.

*Ruled out endstop switch connections (or interference from steppers causing false positives) as triggering the machine halting (connected and reconnected and clicked them while a job is running - this doesn’t interrupt the program).

*Other stuff but my brain is fried at this point and I can’t recall what else I’ve done/ruled out…

Some of this stuff seems like EM interference issues (but I think not actually likely since there isn’t a lot of connected metal and router isn’t plugged in), some seems like web interface issues (resume and clear alarm buttons not responding to clicking) and some just feels like a firmware or controller board problem.

I am nearing my wits end. Not finding any posts/Google results for people struggling with the same stuff. Any pointers happily accepted. Hopefully I’m missing something simple and maybe obvious to someone out there.

Build Photos (plan to tidy wiring after I get this thing reliable enough to make/install the strut plates, unless someone thinks the wiring may legitimately be the issue)

Hey there! Welcome!

I am far, far from an expert here. I usually try to jump in and say the usual suspects so the real geniuses can take it from there if it doesn’t work still.

Looks to me like you have a lot of wires going across the esp32 module. I think in the past Ryan and a lot of others have recommended tidying that up when you have weird issues like what you are describing. Wires passing over the silver part of the ESP32 is not the best, if I’m remembering correctly. Not sure if this will fix your problem. But maybe worth a try?

Also, maybe want to post a copy of your gcode. I doubt that’s an issue with what you are describing. But again, one of the other guys may feel differently.

2 Likes

There are currently some issues being worked on with 3.8.x. It is probably worthwhile to downgrade to 3.7.17.

2 Likes

Zach, thanks for jumping in!

I’ll move the wires by the ESP and see if anything improves! Thanks for the specific suggestion!

Thanks Jason! I’ll try and do that tonight!

Yes, I stopped in to say the same thing that Jason just noted.
There are some architectural things in FluidNC where improvements in Espresif support libraries and things intended to be FluidNC enhancments have backfired a bit. (This is being worked on.)

You are best served going back to 3.7.17 with a full erase and reload.
Let us know if you need help doing that.

Once you’re on 3.7.17, let’'s get a fresh assessment.
There are some things on your list that I see which might hint at at some other potential issues, but we need to get you back to a known good FluidNC first.

1 Like

Thanks everyone for the help so far!

Update…

Fixed that.

Done with that. Did a full erase and running 3.7.17 now. Ran into a couple snags but nothing too crazy.

I haven’t had a chance to do much testing yet to see what is fixed and what isn’t, but initially it does seem like the buttons on the interface that weren’t working before are now! There is still more lag on some (~30% of clicks) than I would expect before it executes a jog. Is it normal to click the button on the web interface and sometimes it takes a couple seconds before the machine responds or should it be pretty responsive feeling as long as there is a solid connection?

I feel like I’m not out of the woods yet, but it seems like some progress was made with the pointers so far.

I’m in EU and have to get myself to put this on pause for the night. I’ll test more thoroughly tomorrow after work and update you all.

Sincere thanks for all the help already.

2 Likes

@Sabotage37 @MakerJim @jeyeager

In addition to cleaning up the wiring and “downgrading” (feels like an upgrade) the FluidNC last night, tonight I went back to the original yaml and manually put in my needed changes (for steps/mm and offsets) as I wanted to rule out that anything got wonky by making changes through the web interface.

I have got to do a little testing tonight, and it seems like everything is working! I need to test and play around with it more to 100% confirm a couple of my issues are fixed as some are/were intermittent, but the buttons on the interface seem that weren’t working before seem to be now.

Thank you all for your help! I have no idea how I otherwise would have known to downgrade the firmware, and to which specific version. And whether it was part of the fix or not, I appreciate the push to clean up the wires.

Next step is to start cutting the strut plates! I’m optimistic that will work now. Thanks again to all three of you for taking the time to help me out!

5 Likes

3.7.17 or 3.7.18? Shoot I have been flashing everything 3.7.18!?!

Pretty sure they said 3.7.18 is where the problems started

1 Like

Yea, 3.7.17 is what @CesarH found. It’s unfortunate as there are some other good fixes in 3.7.18. I also think that in 3.7.17, the homing amps need to be set in the config.yaml since the default is low.

2 Likes

Stability means 3.7.17 for now. With the other caveats above.

1 Like