onBranchElement dominating profiles.

I migrated some code using unordered-containers Strict HashMap. https://github.com/YellowOnion/1brc/blob/14842cd63b6f85892f4b9b315be205fa7c2b1eaf/app/Main.hs#L116

I use one map per-thread and then merge them at the end, this should blow-out my L3 (64MB) and create substantially slower code than using one shared Map.

This implementation takes 12s and uses about 450% CPU load.

Migrating my code to stm-containers, it takes 16s and consumes 950% CPU, and spends 50% of it's time inside onBranchElement.

The core loop looks like this:


stmInsertEntry :: BS.ByteString -> Entry -> HMap -> STM ()
stmInsertEntry !k !v = Map.focus (Focus.insertOrMerge (<>) v) k

calcThread :: (OutChan (Maybe BS.ByteString), Int)
           -> HMap
           -> IO ()
calcThread (oc, id) m = runInBoundThread go
  where go = do
          mbs <- readChan oc
          case mbs of
            Just bs -> do
              let (k, v) = parseEntry bs
                in atomically $ stmInsertEntry k v m
              go
            Nothing -> return ()

Here's a profile:

1brc.prof.txt

nikita-volkov / stm-hamt

onBranchElement dominating profiles. #7