Issue 4978047: code review 4978047: ld: Fixes issue 1899 ("cannot create 8.out.exe")

		Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+13 lines, -3 lines)			Patch
	M	doc/progs/run	View	1 2 3 4 5 6 7 8 9 10 11 12 13 14	3 chunks	+8 lines, -3 lines	1 comment	Download
	M	src/cmd/ld/lib.c	View	1 2 3 4 5 6 7 8 9 10	1 chunk	+5 lines, -0 lines	0 comments	Download

Messages

Total messages: 66

Expand All Messages | Collapse All Messages

Hello golang-dev@googlegroups.com (cc: alex.brainman@gmail.com, golang-dev@googlegroups.com), I'd like you to review this change to https://go.googlecode.com/hg/

13 years, 10 months ago (2011-08-31 02:57:00 UTC) #1

rsc

This fixes two links in a row trying to use the same output file name. ...

13 years, 10 months ago (2011-08-31 10:52:18 UTC) #2

Three in a row work for me. On 2011/08/31 10:52:18, rsc wrote: > This fixes ...

13 years, 10 months ago (2011-08-31 21:14:57 UTC) #3

rsc

Okay, sounds good. Please remove the ifdef and use just "~". We might as well ...

13 years, 10 months ago (2011-08-31 21:16:18 UTC) #4

Hello golang-dev@googlegroups.com, rsc@golang.org (cc: alex.brainman@gmail.com, golang-dev@googlegroups.com), Please take another look.

13 years, 10 months ago (2011-08-31 21:40:41 UTC) #6

rsc

With the changes below it would look fine except that the rename has the arguments ...

13 years, 10 months ago (2011-08-31 21:46:58 UTC) #7

brainman

Please, change CL description to something like: " ld: rename + remove to work around ...

13 years, 10 months ago (2011-08-31 23:45:01 UTC) #8

On 2011/08/31 21:46:58, rsc wrote: > They were backward in your original CL too. Oops. ...

13 years, 10 months ago (2011-09-01 04:26:40 UTC) #9

rsc

yay portability http://codereview.appspot.com/4978047/diff/17003/src/cmd/ld/lib.c File src/cmd/ld/lib.c (left): http://codereview.appspot.com/4978047/diff/17003/src/cmd/ld/lib.c#oldcode73 src/cmd/ld/lib.c:73: remove(outfile); Skipping the remove is not okay ...

13 years, 10 months ago (2011-09-01 17:55:34 UTC) #10

PTAL http://codereview.appspot.com/4978047/diff/17003/src/cmd/ld/lib.c File src/cmd/ld/lib.c (left): http://codereview.appspot.com/4978047/diff/17003/src/cmd/ld/lib.c#oldcode73 src/cmd/ld/lib.c:73: remove(outfile); On 2011/09/01 17:55:35, rsc wrote: Hereinafter: 'Windows' ...

13 years, 10 months ago (2011-09-01 23:54:47 UTC) #11

PTAL

http://codereview.appspot.com/4978047/diff/17003/src/cmd/ld/lib.c
File src/cmd/ld/lib.c (left):

http://codereview.appspot.com/4978047/diff/17003/src/cmd/ld/lib.c#oldcode73
src/cmd/ld/lib.c:73: remove(outfile);
On 2011/09/01 17:55:35, rsc wrote:
Hereinafter: 'Windows' refers to common things of all actual Windows (2000 to
Windows7), 'Windows_7_' refers to Windows7 specific.

> Skipping the remove is not okay either.
> It is working around a Unix file system behavior
> where a binary that is running is not writable,
> so that if you run 6l x.6, 6.out, and then 6l x.6
> again, if the 6.out hasn't finished, the second 6l
> will fail to write a new file.  The remove is supposed
> to take care of that.

Windows does not allow to write to a running file either.
Windows also does not allow to remove a running file.
But Windows allows to rename a running file (within the same filesystem, at
least).

So, this issue of rewriting a running file is the almost the same on Windows.

Solutions differ.
Adding remove() before create() does not solve it. 
Adding rename() does.
After rename(), the temporary file will have the executable code mmap'ed to
memory.
The temporary file becomes the file protected from being removed.
So in the chain "rename(out, tmp), remove(tmp), create(out)", remove() will fail
if the file is running.
And the temporary file will remain.

It also means that "%s~" is not enough.
Imagine 8.out.exe does not terminate long time, we create a new 8.out.exe, run,
the new one 8.out.exe, run, etc. 
Each running 8.out.exe with unique content must have its own file on disk with
unique name.
There will be "8.out.exe~1", "8.out.exe~2", etc. 
Who will take care of their deletion? (Well, on Windows it is possible to add a
filename into a list of files which will be deleted on next reboot when they
definetely are not running. But it looks too tricky to do so in a program like
the linker.)

Do the linker really must not fail on writing to running file? 
I think linker must fail if the output file is running.
If not-failing behavior is really needed, user should use 'rm' before in his
shell or makefile scripts (and see 'rm' fails).

For Unix case you wrote about it could be worth do handle create() failure doing
remove() then create() again.
But not remove() before create() which may succeed. 
On Windows_7_ remove() can break the subsequent create() .

> Maybe Windows is keeping the outfile around
> (delaying the remove) for the same reason?  
> Do you know of any documentation explaining
> why a Windows remove would be delayed?

What I said above on Windows comes from my pre-Windows_7_ experience.
The delayed remove() looks like a novel feature.
It looks more not like an API change but another process (microsoft antivirus?)
keeping the file's handle open.

"The DeleteFile function fails if an application attempts to delete a file that
is open for normal I/O or as a memory-mapped file.
The DeleteFile function marks a file for deletion on close. Therefore, the file
deletion does not occur until the last handle to the file is closed. Subsequent
calls to CreateFile to open the file fail with ERROR_ACCESS_DENIED."
http://msdn.microsoft.com/en-us/library/aa363915(v=vs.85).aspx

> It sounds like maybe the right code here is
> 
> // Unix doesn't like it when we write to a running
> // (or, sometimes, recently run) binary, so remove
> // the output file before writing it.  Windows postpones
> // a remove of a running (or, sometimes, recently run)
> // binary, so rename it before removing it.
Not very correct: Windows does not allow neither to write to nor to remove of a
running file.
The problem is: on Windows_7_ calling remove(name) triggers the weird machinery
which put the filename into a strange state for a while. 
create(name) fails during this period and also the second remove(name) would
fail with ACCESS_DENIED error instead of expected FILE_NOT_FOUND.
The period can be quite long (10s and more).

> p = smprint("%s~", outfile);
> rename(outfile, p);
> remove(p);
> free(p);
> 

Maybe:
	cout = create(outfile, 1, 0775);
	if(cout < 0) {
+		remove(outfile);
+		cout = create(outfile, 1, 0775);
+		if(cout < 0) {
			diag("cannot create %s", outfile);
			errorexit();
+		}
	}

How do you think, does it solve the Unix issue?

rsc

I don't think you have to check the return values. Just do a rename + ...

13 years, 10 months ago (2011-09-01 23:59:48 UTC) #12

On 2011/09/01 23:59:48, rsc wrote: Do you mean? cout = create(outfile, 1, 0775); if(cout < ...

13 years, 10 months ago (2011-09-02 00:15:59 UTC) #13

Hello rsc@golang.org, alex.brainman@gmail.com (cc: golang-dev@googlegroups.com), Please take another look.

13 years, 10 months ago (2011-09-02 00:33:40 UTC) #14

bsiegert

On Thu, Sep 1, 2011 at 19:55, <rsc@golang.org> wrote: > Maybe Windows is keeping the ...

13 years, 10 months ago (2011-09-02 08:24:06 UTC) #15

rsc

http://codereview.appspot.com/4978047/diff/11006/src/cmd/ld/lib.c File src/cmd/ld/lib.c (right): http://codereview.appspot.com/4978047/diff/11006/src/cmd/ld/lib.c#newcode75 src/cmd/ld/lib.c:75: // It is essencial to try create() first No, ...

13 years, 10 months ago (2011-09-02 17:21:22 UTC) #16

On 2011/09/02 17:21:22, rsc wrote: > http://codereview.appspot.com/4978047/diff/11006/src/cmd/ld/lib.c > File src/cmd/ld/lib.c (right): > > http://codereview.appspot.com/4978047/diff/11006/src/cmd/ld/lib.c#newcode75 > ...

13 years, 10 months ago (2011-09-05 16:25:04 UTC) #17

rsc

Okay, then let's just go back to replacing remove(outfile); with #ifndef _WIN32 remove(outfile); #endif

13 years, 10 months ago (2011-09-05 17:07:53 UTC) #18

brainman

http://codereview.appspot.com/4978047/diff/11006/src/cmd/ld/lib.c File src/cmd/ld/lib.c (right): http://codereview.appspot.com/4978047/diff/11006/src/cmd/ld/lib.c#newcode80 src/cmd/ld/lib.c:80: if(0 != remove(outfile)) { I still get errors with ...

13 years, 10 months ago (2011-09-06 02:25:05 UTC) #19

brainman

On 2011/09/05 16:25:04, jp wrote: > > Third ld's run will fail on rename(), then ...

13 years, 10 months ago (2011-09-06 02:30:48 UTC) #20

brainman

On 2011/09/05 17:07:53, rsc wrote: > > #ifndef _WIN32 > remove(outfile); > #endif This fails ...

13 years, 10 months ago (2011-09-06 02:31:32 UTC) #21

On 2011/09/06 02:30:48, brainman wrote: > On 2011/09/05 16:25:04, jp wrote: > > > > ...

13 years, 10 months ago (2011-09-06 04:24:55 UTC) #22

brainman

On 2011/09/06 04:24:55, jp wrote: > > PTAL, tha last patch is Russ' one (which ...

13 years, 10 months ago (2011-09-06 07:09:47 UTC) #23

rsc

I am confused. I thought we had finished enumerating the reasons that rename+remove was not ...

13 years, 10 months ago (2011-09-07 17:52:19 UTC) #24

brainman

On 2011/09/07 17:52:19, rsc wrote: > > #ifndef _WIN32 > remove(outfile); > #endif > I ...

13 years, 9 months ago (2011-09-07 23:12:52 UTC) #25

rsc

On Wed, Sep 7, 2011 at 19:12, <alex.brainman@gmail.com> wrote: > On 2011/09/07 17:52:19, rsc wrote: ...

13 years, 9 months ago (2011-09-08 00:53:40 UTC) #26

brainman

On 2011/09/08 00:53:40, rsc wrote: > ... What is the problem that you are seeing? ...

13 years, 9 months ago (2011-09-08 03:05:26 UTC) #27

On 2011/09/08 03:05:26, brainman wrote: > - remove(outfile); > +// remove(outfile); > $ for i ...

13 years, 9 months ago (2011-09-08 07:44:49 UTC) #28

Reproduced with an external USB drive. It seems to be a bug of MinGW's bash.exe ...

13 years, 9 months ago (2011-09-08 09:50:11 UTC) #29

rsc

http://codereview.appspot.com/4978047/diff/15003/doc/progs/run File doc/progs/run (right): http://codereview.appspot.com/4978047/diff/15003/doc/progs/run#newcode45 doc/progs/run:45: TMPFILE="/tmp/gotest3-$$-$USER" # Write to temporary file to avoid mingw ...

13 years, 9 months ago (2011-09-12 17:00:24 UTC) #30

13 years, 9 months ago (2011-09-12 18:10:05 UTC) #31

rsc

http://codereview.appspot.com/4978047/diff/6004/doc/progs/run File doc/progs/run (right): http://codereview.appspot.com/4978047/diff/6004/doc/progs/run#newcode45 doc/progs/run:45: # Write to temporary file to avoid mingw bash ...

13 years, 9 months ago (2011-09-12 18:29:47 UTC) #32

13 years, 9 months ago (2011-09-12 19:21:57 UTC) #33

brainman

On 2011/09/14 15:19:45, rsc wrote: > LGTM > Can I have more time to test ...

13 years, 9 months ago (2011-09-15 07:38:12 UTC) #35

rsc

On Thu, Sep 15, 2011 at 03:38, <alex.brainman@gmail.com> wrote: > Can I have more time ...

13 years, 9 months ago (2011-09-15 15:21:39 UTC) #36

brainman

Thank you for sticking with this. http://codereview.appspot.com/4978047/diff/11011/doc/progs/run File doc/progs/run (right): http://codereview.appspot.com/4978047/diff/11011/doc/progs/run#newcode46 doc/progs/run:46: TMPFILE="/tmp/gotest3-$$-$USER" I didn't ...

13 years, 9 months ago (2011-09-16 02:27:50 UTC) #37

Thank you for sticking with this.

http://codereview.appspot.com/4978047/diff/11011/doc/progs/run
File doc/progs/run (right):

http://codereview.appspot.com/4978047/diff/11011/doc/progs/run#newcode46
doc/progs/run:46: TMPFILE="/tmp/gotest3-$$-$USER"
I didn't look closely at this. Considering that main problem with 8l is not
resolved (see my other comment), I am not sure if this change helps any or not.

The only thing that I have noticed is that sometimes this script will leave
files in /tmp, because of "set -e". If you proceed with this change, perhaps it
is OK just hard code file name. And it does not need to be in /tmp. Everything
in here runs in sequence.

http://codereview.appspot.com/4978047/diff/11011/src/cmd/ld/lib.c
File src/cmd/ld/lib.c (right):

http://codereview.appspot.com/4978047/diff/11011/src/cmd/ld/lib.c#newcode78
src/cmd/ld/lib.c:78: #endif
I wish it would work, but it doesn't. This program:

package main

import (
	"bytes"
	"exec"
	"io"
	"io/ioutil"
	"log"
	"os"
	"strconv"
)

const prog = `
package main
func main() {
println("Hello")
}
`

/*
func runOne(args ...string) {
	cmd := args[0]
	args = args[1:]
	b, err := exec.Command(cmd, args...).CombinedOutput()
	if err != nil {
		log.Fatalf("%s failed: %s: %s\n", cmd, err, b)
	}
}
*/

func runOne(args ...string) string {
	cmd := args[0]
	fullcmd, err := exec.LookPath(cmd)
	if err != nil {
		log.Fatalf("LookPath(%s): %v", cmd, err)
	}

	r, w, err := os.Pipe()
	if err != nil {
		log.Fatalf("Pipe: %v", err)
	}
	attr := &os.ProcAttr{Files: []*os.File{nil, w, w}}
	p, err := os.StartProcess(fullcmd, args, attr)
	if err != nil {
		log.Fatalf("StartProcess: %v", err)
	}
	defer p.Release()
	w.Close()

	var b bytes.Buffer
	io.Copy(&b, r)
	output := b.String()
	msg, err := p.Wait(0)
	if err != nil {
		log.Fatalf("Wait: %v", err)
	}
	if !msg.Exited() || msg.ExitStatus() != 0 {
		log.Fatalf("ExitStatus(%d): %v", msg.ExitStatus(), output)
	}
	return output
}

func run() {
	// create source file
	err := ioutil.WriteFile("hello.go", []byte(prog), 0666)
	if err != nil {
		log.Fatal(err)
	}
	defer os.Remove("hello.go")
	// compile source file
	runOne("8g", "-o", "hello.8", "hello.go")
	defer os.Remove("hello.8")
	// link executable
	runOne("8l", "-o", "hello.exe", "hello.8")
//	defer os.Remove("hello.exe")
	// run executable
	runOne("./hello.exe")
}

func main() {
	if len(os.Args) != 2 {
		log.Fatal("Invalid numberof args")
	}
	n, e := strconv.Atoi(os.Args[1])
	if e != nil {
		log.Fatalf("Must be a number (%s): %s\n", os.Args[1], e)
	}
	for i := 0; i < n; i++ {
		run()
	}
}

fails if I run it like

test.exe 10000

I wrote this, so we can't blame mingw or anything.

Your version improves things a bit (it doesn't fail as often as original), but
it is not 100%. Your previous attempts (where you were moving file before
deletion) do not work 100% either.

I tend to lean towards submitting this change - at least it improves on our
current situation. Maybe make a comment to say it is not 100%. What do you
think?

On 2011/09/16 02:27:50, brainman wrote: > http://codereview.appspot.com/4978047/diff/11011/doc/progs/run > File doc/progs/run (right): > > http://codereview.appspot.com/4978047/diff/11011/doc/progs/run#newcode46 > ...

13 years, 9 months ago (2011-09-16 02:38:29 UTC) #38

brainman

On 2011/09/16 02:38:29, jp wrote: > 1. does test/run fail on your machine ? I ...

13 years, 9 months ago (2011-09-16 02:51:56 UTC) #39

On 2011/09/16 02:51:56, brainman wrote: > On 2011/09/16 02:38:29, jp wrote: > > 1. does ...

13 years, 9 months ago (2011-09-16 03:01:12 UTC) #40

brainman

I see no way to improve on your change to ld. Please fix problem with ...

13 years, 9 months ago (2011-09-19 03:49:15 UTC) #41

On 2011/09/19 03:49:15, brainman wrote: PTAL > I see no way to improve on your ...

13 years, 9 months ago (2011-09-23 23:56:05 UTC) #42

brainman

On 2011/09/23 23:56:05, jp wrote: > On 2011/09/19 03:49:15, brainman wrote: > PTAL Please, see ...

13 years, 9 months ago (2011-09-24 00:32:10 UTC) #43

hector

On Sep 24, 2:18 am, jp wrote: > > > I wonder what happens if ...

13 years, 9 months ago (2011-09-24 12:18:57 UTC) #44

brainman

On 2011/09/24 12:18:57, hector wrote: > > I think we need to clarify what we ...

13 years, 9 months ago (2011-09-24 13:08:13 UTC) #45

hector

On 2011/09/24 13:08:13, brainman wrote: > On 2011/09/24 12:18:57, hector wrote: > > > > ...

13 years, 9 months ago (2011-09-24 14:26:03 UTC) #46

> I think the only sane thing to do now is to revert the change ...

13 years, 9 months ago (2011-09-24 19:12:38 UTC) #47

brainman

It seems to me, we have wasted enough time with this. As I said earlier, ...

13 years, 9 months ago (2011-10-03 06:39:48 UTC) #48

brainman

Tried to submit it, but build fails now: doc/progs/run fails, because "testit helloworld3 ..." fails ...

13 years, 9 months ago (2011-10-04 05:32:40 UTC) #50

rsc

On Tue, Oct 4, 2011 at 01:32, <alex.brainman@gmail.com> wrote: > Tried to submit it, but ...

13 years, 9 months ago (2011-10-05 15:33:46 UTC) #51

brainman

On 2011/10/05 15:33:46, rsc wrote: > On Tue, Oct 4, 2011 at 01:32, <mailto:alex.brainman@gmail.com> wrote: ...

13 years, 9 months ago (2011-10-06 00:21:54 UTC) #52

brainman

On 2011/10/06 17:38:03, rsc wrote: > > Put the 8.out command in ( ) I ...

13 years, 9 months ago (2011-10-06 23:10:36 UTC) #54

rsc

On Thu, Oct 6, 2011 at 19:10, <alex.brainman@gmail.com> wrote: > I don't think we want ...

13 years, 9 months ago (2011-10-06 23:14:37 UTC) #55

brainman

On 2011/10/06 23:14:37, rsc wrote: > > Add || true to the end of the ...

13 years, 9 months ago (2011-10-06 23:17:05 UTC) #56

I did not understand which change do you want to be done. On 2011/10/06 23:17:05, ...

13 years, 9 months ago (2011-10-07 08:56:39 UTC) #57

brainman

http://codereview.appspot.com/4978047/diff/41001/doc/progs/run File doc/progs/run (right): http://codereview.appspot.com/4978047/diff/41001/doc/progs/run#newcode60 doc/progs/run:60: ./$O.out | $2 2>&1 >"$TMPFILE" I think you could ...

13 years, 9 months ago (2011-10-07 11:05:04 UTC) #58

bradfitz

Is this still unresolved? I'm hitting this problem on windows-386 on Windows 7. Why are ...

13 years, 8 months ago (2011-10-14 17:37:17 UTC) #59

hector

The buildbots are slower than our PCs? If jp would move this forward, we can ...

13 years, 8 months ago (2011-10-14 17:40:50 UTC) #60

Hello rsc@golang.org, alex.brainman@gmail.com, bsiegert@gmail.com, hectorchu@gmail.com, bradfitz@golang.org (cc: golang-dev@googlegroups.com), Please take another look.

13 years, 8 months ago (2011-10-14 19:03:41 UTC) #61

hector

Don't submit yet - I'm looking at this CL now and I don't think it ...

13 years, 8 months ago (2011-10-14 19:24:19 UTC) #63

hector

http://codereview.appspot.com/4978047/diff/67001/doc/progs/run File doc/progs/run (right): http://codereview.appspot.com/4978047/diff/67001/doc/progs/run#newcode50 doc/progs/run:50: ./$O.out $2 2>&1 >"$TMPFILE" You need to add || ...

13 years, 8 months ago (2011-10-14 19:29:55 UTC) #64

hector

Since it's close enough, and I've made the necessary change locally, I'll go ahead and ...

13 years, 8 months ago (2011-10-14 19:35:46 UTC) #65

hector

13 years, 8 months ago (2011-10-14 19:37:32 UTC) #66

*** Submitted as http://code.google.com/p/go/source/detail?r=f650efd9ed8d ***

ld: Fixes issue 1899 ("cannot create 8.out.exe")

http://code.google.com/p/go/issues/detail?id=1899

R=rsc, alex.brainman, bsiegert, hectorchu, bradfitz
CC=golang-dev
http://codereview.appspot.com/4978047

Committer: Hector Chu <hectorchu@gmail.com>

Expand All Messages | Collapse All Messages