[cdwg] Summary of telecon on July 26, and review of information for the board
Christopher J. Morrone
morrone2 at llnl.gov
Fri Jul 27 12:03:04 PDT 2012
Hi folks,
Here is a summary of the telecon from July 26 from my notes:
Attendees:
Chris Morrone - LLNL
Peter Jones - Intel
Cory Spitz - Cray
Justin Miller - IU
Mark Gary - LLNL
James Simmons - ORNL
John Hammond - TACC
My apologies if I missed anyone.
Some of the topics were:
- Maintenance releases
There was general agreement that we will all target our efforts towards
making 2.4 the next maintenance branch. See note for the board at the
end of this email for more information. Comments should be made by the
end of Monday July 30th (lets say 4pm Pacific).
- Conversation about testing (and how we need more)
- Justin reported that the OpenSFS test cluster at IU is targeting
August 1st to allow PAC contract testing to begin. Rough specs are: 24
compute nodes, 4 OSS nodes, 4 MDS nodes, IB connected, Westmere
processors. They are working on a wiki that will document the hardware.
* CDWG identified as likely group to schedule usage of the cluster.
- Need for unified roadmap discussed. CDWG wants to take that on. We
need to make sure that our roadmap is well communicated to our community
and to our board to avoid us advertising conflicting roadmaps.
And now my summary of the Lustre maintenance release plan as I
understand it. As I mentioned early, please let me know if you think
I've gotten anything wrong as soon as possible. I plan to relay this to
the OpenSFS board by the end of the day on Monday.
Summary of Lustre Maintenance Release Plan
------------------------------------------
Lustre maintenance branches host the releases of lustre that we shall
advertise to the general public as "stable" releases. The goal of a
maintenance branch is to include only bug fixes, to ensure a stable
releases for a significant period of time.
We currently have a development cadence that puts out a "feature"
release of Lustre every 6 months. This is going reasonably well, and we
plan to continue that process. However, we do not have the resources to
ensure that every six month release is entirely bug-free, nor do we have
the resources to add a new branch every six months that will receive
only bug, and be maintained for years. We are not able to reasonably
support and test that many branches in parallel at our current level of
investment, nor would we wish to, as the testing requirements rise
exponentially as the number of supported branches increases.
We have decided that every third feature release, occurring every 18
months, will also be the beginning of a maintenance branch. Tagged
maintenance releases along the maintenance branch will occur on an
as-needed basis, according to the demands of discovered bugs and their
severity. It is likely that tags will occur more frequently early in
the branch's lifetime, and taper off in the later months and years.
We plan to make the initially advertised lifetime of a maintenance
branch three years.
We agreed that Lustre 2.4 will begin the next maintenance branch, which
has a targeted release date of the end of March, 2013.
There was some discussion among folks who intend to use 2.3 in
production that it would be nice to have a "mini-maintenance" branch to
hold them over until 2.4 is released. Note that this branch would not
be supported for a significant period of time, likely only a few month.
This will happen if there is sufficient demand and resources applied,
but to avoid confusion among the broader community we will not be
advertising this release. We will also try to avoid using the term
"maintenance" in association with 2.3 to avoid expectations of long term
support if that branch becomes a reality.
Chris
More information about the cdwg
mailing list