Fix semaphores in IDF & std::string assert #2728

h2zero · 2019-04-30T23:23:32Z

Fixes the problem of giving a mutex from a callback with the latest IDF. Also addresses an occasional assert that happens when the btc_task callback gives the semaphore and causes an assert due to both cores potentially writing m_owner concurrently.

projectgus · 2019-05-01T00:27:03Z

libraries/BLE/src/FreeRTOS.cpp

@@ -62,22 +62,21 @@ uint32_t FreeRTOS::getTimeSinceStart() {
 uint32_t FreeRTOS::Semaphore::wait(std::string owner) {
 	log_v(">> wait: Semaphore waiting: %s for %s", toString().c_str(), owner.c_str());

+	m_owner = owner;


Setting the owner before we take the semaphore seems like maybe it can lead to incorrect behaviour. Think about this sequence of events:

Task A calls s->wait("task_a"). m_owner is set to "task_a", and then waiting for the semaphore succeeds because noone was holding it.

Task B calls s->wait("task_b"). m_owner is set to "task_b" but then Task B is blocked in xSemaphoreWait() because Task A is still holding the semaphore.

Now we have a situation where Task A is holding the semaphore, but the owner field indicates Task B is holding it.

Good point, hadn't really thought about it as the way the BLE library currently works (on the client side anyway) there is multiple semaphores used and only one task actually takes each of them. Also, as this was a mutex before, the semaphore is usually taken already before we call the wait function to block the task and wait for the callback to give the semaphore back, at which time it sets m_owner to "<N/A>" anyway.

The reason for this change though is I've encountered many instances of on assert error when giving the semaphore from the callback as both cores sometimes try to write to m_owner at the same time, one in the give() on core 0 and this one after we've unblocked in wait() on core 1.

Anyway, maybe a better solution would be to pin the callback task to core 1, or maybe just remove the setting of m_owner in wait() all together? I'm open to suggestions :).

What is the point of this m_owner anyway? Logging?

Yes as far as I can tell it’s only used for logging, but I suppose someone could call the toString() method on the semaphore to check ownership.

I’m tempted just to remove it entirely for my use, but others might find it useful.

@h2zero

Right, sorry I should have read the full code - I missed that wait() isn't "take", it's "take then give". Not quite what I expected...

The reason for this change though is I've encountered many instances of on assert error when giving the semaphore from the callback as both cores sometimes try to write to m_owner at the same time, one in the give() on core 0 and this one after we've unblocked in wait() on core 1.

If this is the problem then I think the best solution is to change all the m_owner assignments so they only happen when holding the semaphore, so the semaphore protects against any race condition. I think this means:

In ::give(), move setting of m_owner up to before the semaphore is given so it's assigned while the caller still holds the semaphore.

In ::wait(), don't set m_owner at all. As soon as the caller in wait() gets the semaphore it gives it back, and giving a semaphore never blocks, so if you did set m_owner then it gets immediately un-set afterwards which seems like a waste of CPU cycles.

@projectgus

Yeah that’s along the line of my thinking as well. Sorry I changed the PR yesterday assuming you wanted me to revert the m_owner move and I didn’t see your reply here first. I’ll make the changes as you’ve mentioned and fix the PR again lol.

@projectgus

Made the changes, tested with my test code that would cause the assert within seconds and has been running over an hour now. Logging works fine as well.

Awesome! :)

Update esp32-hal-rmt.c

Revert previous revert commit and move setting of m_owner in ::give to before giving the semaphore to prevent race condition possibility.

h2zero mentioned this pull request Apr 30, 2019

BLE isnt working properly while using Arduino as esp-idf component #2723

Closed

projectgus reviewed May 1, 2019

View reviewed changes

h2zero added 3 commits May 1, 2019 12:10

Merge pull request #1 from espressif/master

2d5e739

Update esp32-hal-rmt.c

Restored m_owner position in wait() as requested

9919451

Reapply assert fix and move setting m_owner in ::give()

8fd7f05

Revert previous revert commit and move setting of m_owner in ::give to before giving the semaphore to prevent race condition possibility.

me-no-dev merged commit 43bf393 into espressif:master May 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix semaphores in IDF & std::string assert #2728

Fix semaphores in IDF & std::string assert #2728

h2zero commented Apr 30, 2019

projectgus May 1, 2019 •

edited

Loading

h2zero May 1, 2019

me-no-dev May 1, 2019

h2zero May 1, 2019

projectgus May 1, 2019 •

edited

Loading

h2zero May 2, 2019 •

edited

Loading

h2zero May 2, 2019

projectgus May 3, 2019

Fix semaphores in IDF & std::string assert #2728

Fix semaphores in IDF & std::string assert #2728

Conversation

h2zero commented Apr 30, 2019

projectgus May 1, 2019 • edited Loading

Choose a reason for hiding this comment

h2zero May 1, 2019

Choose a reason for hiding this comment

me-no-dev May 1, 2019

Choose a reason for hiding this comment

h2zero May 1, 2019

Choose a reason for hiding this comment

projectgus May 1, 2019 • edited Loading

Choose a reason for hiding this comment

h2zero May 2, 2019 • edited Loading

Choose a reason for hiding this comment

h2zero May 2, 2019

Choose a reason for hiding this comment

projectgus May 3, 2019

Choose a reason for hiding this comment

projectgus May 1, 2019 •

edited

Loading

projectgus May 1, 2019 •

edited

Loading

h2zero May 2, 2019 •

edited

Loading