Re: FIO enter dead-loop to look for new file

From: Jens Axboe <jens.axboe_at_oracle.com>
Date: Tue, 26 Feb 2008 05:10:41 +0100

On Tue, Feb 26 2008, Zhang, Yanmin wrote:
> On Mon, 2008-02-25 at 15:36 +0100, Jens Axboe wrote:
> > On Mon, Feb 25 2008, Zhang, Yanmin wrote:
> > > On Mon, 2008-02-25 at 06:28 +0100, Jens Axboe wrote:
> > > > On Mon, Feb 25 2008, Zhang, Yanmin wrote:
> > > > > On Wed, 2008-02-13 at 16:58 +0800, Zhang, Yanmin wrote:
> > > > > > On Tue, 2008-02-05 at 10:06 +0100, Jens Axboe wrote:
> > > > > > > On Tue, Feb 05 2008, Jens Axboe wrote:
> > > > > > > > On Tue, Feb 05 2008, Zhang, Yanmin wrote:
> > > > > > > > > On Mon, 2008-02-04 at 11:01 +0100, Jens Axboe wrote:
> > > > > > > > > > On Mon, Feb 04 2008, Zhang, Yanmin wrote:
> > > > > > > > > > > On Mon, 2008-02-04 at 17:03 +0800, Zhang, Yanmin wrote:
> > > > > > > > > > > > When I used below job file to test, it hangs. I used gdb to check it and found
> > > > > > > > > > > > thread_main keeps calling clear_io_state over and over again. Every sub-process
> > > > > > > > > > > > has one file, but it doesn't finish its task after it finishs the file, so it
> > > > > > > > > > > > calls do_io again and again although it has no more file.
> > > > > > > > > > > >
> > > > > > > > > > > > If change bsrange=4k-4k, it does work. if it's 2k-4k, it also doesn't work.
> > > > > > > > > > > If I use bs=2k to replace bsrange, it looks like it does work although
> > > > > > > > > > > my testing is still running.
> > > > > > > > > >
> > > > > > > > > > Can you try the current version, I fixed some bugs in this area on
> > > > > > > > > > friday? Either use git to download it, or just use
> > > > > > > > > >
> > > > > > > > > > http://brick.kernel.dk/snaps/fio-git-latest.tar.gz
> > > > > > > > > I tried it. with bsrange=2k-4k, it doesn't hang. However, there is
> > > > > > > > > anthoer issue. I used 9 disks and every disk has a 1GB file. Every 2
> > > > > > > > > threads do I/O on one file, so there are 18 threads and 9 groups. With
> > > > > > > > > the new fio-git, the status shows there are just 4 threads working on
> > > > > > > > > I/O. The result also showed 5 groups has no result.
> > > > > This issue appears again in the latest tarball of Feb. 21st.
> > > >
> > > > That's odd, and you state that the fix is missing. Perhaps my git tar
> > > > ball script is broken. What does git log say in the directory of the
> > > > downloaded tar ball?
> > >
> > > 1) When I untie the tarball fio-git-latest.tar.gz, tar reported:
> > > fio/HOWTO
> > >
> > > gzip: stdin: decompression OK, trailing garbage ignored
> > > fio/blktrace_api.h
> > > tar: Child returned status 2
> > > tar: Error exit delayed from previous errors
> > >
> > > But I still could compile and use it.
> > >
> > > 2) git log output of last items. It looks like no the fix.
> >
> > Please re-download the tar ball. I checked it last night when replying
> > and it was fine, you must have a corrupted copy perhaps? I'd definitely
> > not trust anything with a gzip integrity warning :-)
> I re-downloaded it from another machine and it does work now. It looks like
> something is wrong about the IT proxy.
>
> Suggestion: How about to create a MD5SUM file with the source tarball?

Good suggestion, I'll update the scripts!

-- 
Jens Axboe
Received on Tue Feb 26 2008 - 05:10:41 CET

This archive was generated by hypermail 2.2.0 : Tue Feb 26 2008 - 05:30:03 CET