[duplicate] assigning var to local let does not properly copy in default gc

krux02 commented 4 years ago

Function echo outputs the wrong string.

Example

var myGlobal = @[1,2,3,4,5]

proc foo() =
  let myGlobalCopy = myGlobal
  myGlobal[0] = 123
  echo myGlobalCopy

foo()

Current Output

@[123, 2, 3, 4, 5]

Expected Output

@[1, 2, 3, 4, 5]

Additional Information

Variants that work expectedly:

use an array istead of a seq

var myGlobal = [1,2,3,4,5]
proc foo() =
  let myGlobalCopy = myGlobal
  myGlobal[0] = 123
  echo myGlobalCopy
foo()

use a var instead of a let:

var myGlobal = @[1,2,3,4,5]
proc foo() =
  var myGlobalCopy = myGlobal
  myGlobal[0] = 123
  echo myGlobalCopy
foo()

wrap the seq in a tuple

var myGlobal = (@[1,2,3,4,5],)
proc foo() =
  let myGlobalCopy = myGlobal
  myGlobal[0][0] = 123
  echo myGlobalCopy
foo()

This issue caused a regression in my PR here: https://github.com/nim-lang/Nim/pull/13687#issuecomment-602307527 (regression 1)

$ nim -v
Nim Compiler Version 1.1.1 [Linux: amd64]
Compiled at 2020-03-26
Copyright (c) 2006-2019 by Andreas Rumpf

git hash: 46c827be6a959be3d3bb840d1582bac8fc55bda6
active boot switches: -d:release -d:danger

rayman22201 commented 4 years ago

This does seem like a real issue. Araq has said previously stated that the spec states that assigning to let from a global should copy (or at least have copy semantics): https://github.com/nim-lang/Nim/issues/13140#issuecomment-574284278

Araq commented 4 years ago

--gc:arc fixes that.
The spec changed, originally let was introduced to mimic parameter passing semantics that also don't copy.

krux02 commented 4 years ago

Are you saying that this bug doesn't need to be fixed, because there exists a compiler flag (with btw many other problems) where this bug does not appear?

rayman22201 commented 4 years ago

1. `--gc:arc` fixes that.

2. The spec changed, originally `let` was introduced to mimic parameter passing semantics that also don't copy.

1.) Are we not supporting the other gcs now?

2.) What did the spec change into? Because this behavior is unintuitive and not obvious.

Araq commented 4 years ago

I'm saying the "bug" has a history, is well known and now with move semantics we can finally fix it, even though the solution is still incomplete without @timotheecour 's .byaddr.

What did the spec change into? Because this behavior is unintuitive and not obvious.

It needs to cause a copy but doesn't and my very own code relies on the current performance implications.

krux02 commented 4 years ago

my very own code relies on the current performance implications.

Then fix the code.

timotheecour commented 4 years ago

how about following:

introduce --legacy:letDoesNotCopy
when semcheck detects a let statement that should copy but doesn't under current semantics up to today:
- if letDoesNotCopy is provided, issue a warning warnLetDoesNotCopy (and maybe suggest to simply add {.byaddr.})
- else, use --gc:arc semantics, ie, make a copy
make --legacy:letDoesNotCopy localizable just like {.experimental:forLoopMacros.} to allow gradual transition instead of forcing users to fix all their code at once
warning can be suppressed as usual via --warning:warnLetDoesNotCopy:off
[EDIT] we could also introduce {.bycopy.} to make gradual transition easier, eg: let a {.bycopy.} = expr ; {.bycopy.} would force copy semantics. The pragma is ignored unless we're inside a {.legacy:letDoesNotCopy.} section. I expect this can be used in a few key performance places.

Note: I'm going to use the same thing for https://github.com/nim-lang/Nim/pull/13792; {.legacy:implicitCstringConv.}

Araq commented 4 years ago

Then fix the code.

So predictable. What about code I don't have control over? It's a breaking change, that's all I'm saying. And we collect the new, mildly breaking stuff under --gc:arc already.

timotheecour commented 4 years ago

I think this issue is essentially the same as https://github.com/nim-lang/Nim/issues/13140 (https://github.com/nim-lang/Nim/issues/13431 is somewhat related too); but it's good to have the examples shown in this issue

--gc:arc fixes that

in top example but not in following example (adapted from https://github.com/nim-lang/Nim/issues/13140): doAssert fails with or without --gc:arc here:

when true:
  type Bar = ref object
    foo: string
  proc main(a: Bar)=
    let b = a.foo
    doAssert cast[int](a.foo[0].unsafeAddr) != cast[int](b[0].unsafeAddr)
  let a = Bar(foo: "abc")
  main(a)

krux02 commented 4 years ago

@timotheecour I tried your example. After all It uses unsafeAddr and I don't agree that this is a bug yet. I tried to tried to create an unexpected side effect, and I could not get one.

type Bar = ref object
  foo: string

proc main(a: Bar) =
  let b = a.foo
  echo a.foo[0].unsafeAddr == b[0].unsafeAddr
  a.foo[0] = '3'
  echo b
  echo a.foo
  echo a.foo[0].unsafeAddr == b[0].unsafeAddr

let a = Bar(foo: "abc")
main(a)

output:

true
abc
3bc
false

This looks to me like copy on write

rayman22201 commented 4 years ago

@Araq Thanks for providing more context. After your response, I favor a deprecation path approach. Classic case of Hyrum's Law

Varriount commented 4 years ago

I'm saying the "bug" has a history, is well known and now with move semantics we can finally fix it, even though the solution is still incomplete without @timotheecour 's .byaddr.

What did the spec change into? Because this behavior is unintuitive and not obvious.

It needs to cause a copy but doesn't and my very own code relies on the current performance implications.

What about shallowCopy? Or simply taking a pointer? Both of these mechanisms already exist for just these kinds of optimizations.

Then fix the code.

So predictable. What about code I don't have control over? It's a breaking change, that's all I'm saying. And we collect the new, mildly breaking stuff under --gc:arc already.

If this is the case, then I don't see how byaddr (or any other additions that require explicit code changes) will be the solution then. If our restriction here is "we need something that doesn't require code changes" , then the best solution would be to add to/improve the compiler mechanisms that do memory copy optimization (sink, etc).

Aside from this, I don't consider performance regression a breaking change, especially when it's caused by fixing behavior that's dangerous/unspecified.

Nim already gives you plenty of tools to do dangerous things when you need to. Up until now I thought that the philosophy was "provide safe/sane behavior by default, and then give the users explicit tools to tinker with the inner-workings when they need to". Has this changed?

timotheecour commented 4 years ago

What about shallowCopy

doesn't help with most types, see example1 and example2 below.

Or simply taking a pointer?

that's more cumbersome to use (you have do deref at each location) and inherently less safe (your pointer can escape, whereas the byaddr is deref-on-use).

import std/unittest
import std/pragmas

type Foo = object
  bar: array[10, int]
  age: int

var count = 0
proc getAgeSideEffect(a: var Foo): var int =
  count.inc
  a.age

proc fun[T](a: var T) =
  block:
    # example 1:
    var b {.byaddr.} = a.bar
    doAssert b[0].unsafeAddr == a.bar[0].unsafeAddr

    # example 2:
    var b2 {.byaddr.} = a.getAgeSideEffect
    b2 += 10
    doAssert a.age == 10
    b2 -= 10
    doAssert a.age == 0
    doAssert count == 1

  block:
    var b: type(a.bar)
    b.shallowCopy a.bar
    check b[0].unsafeAddr == a.bar[0].unsafeAddr # fails

    var b2: int
    b2.shallowCopy a.getAgeSideEffect
    b2+=13
    check a.age == 13 # fails

var a: Foo
a.bar[0] = 3
fun(a)

/Users/timothee/git_clone/nim/timn/tests/nim/all/t10471.nim(38, 26): Check failed: b[0].unsafeAddr == a.bar[0].unsafeAddr
/Users/timothee/git_clone/nim/timn/tests/nim/all/t10471.nim(43, 16): Check failed: a.age == 13
a.age was 0

Aside from this, I don't consider performance regression a breaking change, especially when it's caused by fixing behavior that's dangerous/unspecified.

performance regressions are a breaking change (eg: your server used to handle the load and now it doesn't). Furthermore it's not just about performance, it's about ref semantics: whether you are copying or taking a reference affects code semantics.

That's why I'm considering this, at least for transition period:

let a1 {.byaddr.} = expr # reference semantics
let a2 {.bycopy.} = expr # copy semantics
let a3 = expr
  # always copy for pointer types (ref/ptr/pointer/cstring), ordinal, float
  # warning (or even error) for other types (seq, string, object, set), since behavior differs bw --gc:arc vs other

Araq commented 4 years ago

Up until now I thought that the philosophy was "provide safe/sane behavior by default, and then give the users explicit tools to tinker with the inner-workings when they need to". Has this changed?

No, on the contrary, we are about to create a copy by default because that's safer to do. We don't do it as a "bugfix" though because O(1) vs O(n) behaviour is not a "bugfix".

rayman22201 commented 4 years ago

@Varriount

Aside from this, I don't consider performance regression a breaking change, especially when it's caused by fixing behavior that's dangerous/unspecified.

I have to agree with Timothee here. Performance can be a bug. It is important. What is special about this case is that it is performance vs. correctness. It's not just a performance regression. It's a semantic change. On this part, I think you and I agree.

In general, I agree, correctness is more important. But, in the real world, we can't just suddenly change the behavior. You must provide a deprecation path.

As a person who works with corporate level software that depends on performance characteristics for their day job, I can tell you, people get really mad when you suddenly change the performance without warning (even in the name of correctness.) And everyone gets mad when semantics they rely on suddenly change (even if those semantics were incorrect. See my above comment about Hyrum's law.)

Nim already gives you plenty of tools to do dangerous things when you need to. Up until now I thought that the philosophy was "provide safe/sane behavior by default, and then give the users explicit tools to tinker with the inner-workings when they need to".

I'm a bit naive when it comes to the {.byaddr.} and {.bycopy.} pragmas... I know Krux doesn't like it. Part of this is because adding more features is a maintenance burden in general. I understand that.

But as far as the feature itself, I don't know enough to have an opinion on the semantic pros and cons of those features. I have to trust araq and smarter people (until I have time to research it).

Either way, this feature seems exactly aligned with the idea of "provide safe/sane behavior by default, and then give the users explicit tools to tinker with the inner-workings when they need to".

You don't have to use {.byaddr.}. It's an advanced feature that you use if you need it. That is exactly in line with Nim's model.

If our restriction here is "we need something that doesn't require code changes" , then the best solution would be to add to/improve the compiler mechanisms that do memory copy optimization (sink, etc).

I don't think this feature blocks the compiler from making internal optimizations in any way for most normal code... It simply provides explicit control for those who want it. (again, I may be wrong here. I haven't fully understood the feature yet.)

IIUC, for the immediate future, araq is fixing the default, and timothee's flag idea is a verbose but reasonable solution for deprecation.

timotheecour commented 3 years ago

this issue is mis-diagnosed; it has nothing to do with global var since it can be reproduced with assigning a local var to a local let:

when true:
  template fn =
    var a = @[1,2,3] # with array instead, there'd be no difference bw default gc and gc:arc
    let b = a
    a[0] += 10
    echo (b: b, isCopy: b[0].unsafeAddr != a[0].unsafeAddr)
  fn()
  proc main = fn()
  main()

default gc: (b: @[1, 2, 3], isCopy: true) (b: @[11, 2, 3], isCopy: false)

gc:arc (b: @[1, 2, 3], isCopy: true) (b: @[1, 2, 3], isCopy: true)

instead this looks like a duplicate of https://github.com/nim-lang/Nim/issues/2314 so we could IMO close this as duplicate but it has useful discussions above which should be referenced in https://github.com/nim-lang/Nim/issues/2314

ringabout commented 2 years ago

Duplicate of https://github.com/nim-lang/Nim/issues/2314

nim-lang / Nim