“if any thing can go wrong, it will. 该出错的地方迟早要出错。”
----墨菲定
律
很多人看到这个定律的第一反应的哑然失笑,然后又觉得有点道理。说它是定律因为它具备定律的所有要素:从过去的实践中提炼出来,可以解释现在的事实,能够预测未来。
哥伦比亚号的事故应验了这个定律。它再次告诉我们,客观规律必须遵守。违背客观规律,科学技术和经济实力最强大的美国也同样必然付出惨痛的代价。
哥伦比亚号事故原因是多方面的,包括政府、国会、社会政治环境和NASA的文化、组织和结构以及工程技术本身的原因。很多原因与我们国家的很多事故几乎如出一辙。如为了政治目的赶进度,为了降低成本等不做必须的分析研究和试验;官僚主义,唯我独尊,听不得不同意见;不遵守规范,凭经验拍脑袋做决定等。
但是哥伦比亚号的调查组是完全独立的,几乎毫不隐瞒地指出了政府、国会和NASA本身的所有相关责任,令人敬佩不已。这样才能真正找到事故原因,防止今后类似事故的发生。推动人类的进步。
以下是对哥伦比亚号事故原因的调查和分析结果摘要,为了清楚地说明事故原因,调查组还引用了对挑战者号事故的调查结果。原文摘自哥伦比亚号事故调查报告,本文略去了繁琐复杂的工程技术细节。
“...the Board has considered itself an independent and public institution,accountable to the American public, the White House, Congress, the astronaut corps and their families, and NASA. With the support of these constituents, the Board resolved to broaden the scope of the accident investigation into a far-reaching examination of NASA's operation of the Shuttle fleet. We have explored the impact of NASAʼs organizational history and practices on Shuttle safety, as well as the roles of public expectations and national policy-making.”
“专家组认为自己是独立的公众机构,对美国公众、白宫、国会、宇航员团队及其家属和NASA负责。有这些方面的支持,专家组决心扩大事故的调查范围,深入审查NASA的航天飞机团队运营。我们考察了NASA的关于航天飞机安全的组织历史和实践,以及公众期待和政策制定的影响。”
“After nearly seven months of investigation, the Board has been able to arrive at findings and recommendations aimed at significantly reducing the chances of further accidents. Our aim has been to improve Shuttle safety by multiple means, not just by correcting the specific faults that cost the nation this Orbiter and this crew. With that intent, the Board conducted not only an investigation of what happened to Columbia, but also – to determine the conditions that allowed the accident to occur – a safety evaluation of the entire Space Shuttle Program. Most of the Board's efforts were undertaken in a completely open manner. By necessity, the
safety evaluation was conducted partially out of the public view, since it included frank, off-the-record statements by a substantial number of people connected with the Shuttle Program. ”
“经过近七个月的调查,专家组可以推出一些旨在极大地减少未来发生事故的机会的发现和建议。我们的目的一直是通过多种手段改进航天飞机的安全性,而不仅仅是矫正造成国家损失了这个轨道器和一个机组的特定失误。本着这个目的,专家组不仅进行了关于哥伦比亚号是发生了什么的调查,还审查了导致这个事故发生的条件--一个关于这个航天飞机项目的安全性评估。专家组的主要精力都用在了实行一个完全开放的方式。安全评估没有全部对公众公开是必要的,因为它包括与航天飞机项目有关的重要人员的坦诚的无记录的讲话。”
“In this process, the Board identified a number of pertinent factors, which we have grouped into three distinct categories: 1) physical failures that led directly to Columbia's destruction; 2) underlying weaknesses, revealed in NASA's organization and history, that can pave the way to catastrophic failure; and 3) “other significant observations” made during the course of the investigation, but which may be unrelated to the accident at hand. Left uncorrected, any of these factors could contribute to future Shuttle losses.”
“在这个过程中,专家组确定了几个相关因素,将其分为三个鲜明的范畴:1),直接导致哥伦比亚号解体的物理失效;2),NASA的组织和历史中暴露的滋生大灾难的弱点;3)在调查过程中获得的可能初看与这次事故无关的“其它重大发现”。这些如果不纠正,其中任何一个因素都可能导致未来的航天飞机失事。”
“The Board recognized early on that the accident was probably not an anomalous, random event, but rather likely rooted to some degree in NASA's history and the human space flight program's culture. Accordingly, the Board broadened its mandate at the outset to include an investigation of a wide range of historical and organizational issues, including political and budgetary considerations, compromises, and changing priorities over the life of the Space Shuttle Program. The Board's conviction regarding the importance of these factors strengthened as the investigation progressed, with the result that this report, in its findings, conclusions, and recommendations, places as much weight on these causal factors as on
the more easily understood and corrected physical cause of the accident.”
“专家组早就认识到这次事故可能不是一个不正常的随机事故,而是极可能在某些程度上植根于NASA的历史和载人航天文化。相应地,专家组从一开始就扩大了它的授权范围包括了历史和组织问题的广泛调查,包括政治和预算考虑,折中方案,以及航天飞机项目发展过程中不断改变的要务。专家组的关于这些因素重要性的判定随着调查的进展更加坚定了。这个判定包含这份报告的结果,提出的发现、结论和建议,给予这些偶然因素和更多容易理解和纠正的事故的物理原因同等的注重。”
“The initial Shuttle design predicted neither foam debris problems nor poor sealing action of the Solid Rocket Booster joints. To experience either on a mission was a violation of design specifications. These anomalies were signals of potential danger, not something to be tolerated, but in both cases after the first incident the engineering analysis concluded that the design could tolerate the damage. These engineers decided to implement a temporary fix and/or accept the risk, and fly. For both O-rings and foam, that first decision was a turning point. It established a precedent for accepting, rather than eliminating, these technical deviations. As a result of this new classification, subsequent incidents of O-ring erosion or foam debris strikes were not defined as signals of danger, but as evidence that the design was now acting as predicted.”
“航天飞机的初始设计即没有预料到泡沫脱落问题也没有预料到固体火箭助推器连接密封性能这么差。把两者中任何一个带到任务中都违背了设计规范。这些反常情况是潜在危险的信号,而不是可以容忍的,但是在两个事故中第一次发现问题时工程分析的结论都是设计可以承受损伤。这些工程师们决定实施临时修补或接受风险,继续飞。对O型密封圈和泡沫,这个第一次决定都是转折点。它开创了接受而不是消除这些技术超差的先例。这些新分类的结果是此后的O型密封圈腐蚀或泡沫脱落都不被规定为危险信号,而是设计如预料的起作用的证据。”
“Engineers and managers incorporated worsening anomalies into the engineering experience base, which functioned as an elastic waistband, expanding to hold larger deviations from the original design. Anomalies that did not lead to catastrophic failure were treated as a source
of valid engineering data that justified further flights. These anomalies were translated into a safety margin that was extremely influential, allowing engineers and managers to add incrementally to the amount and seriousness of damage that was acceptable. Both O-ring erosion and foam debris events were repeatedly “addressed” in NASAʼs Flight Readiness
Reviews but never fully resolved. In both cases, the engineering analysis was incomplete and inadequate. Engineers understood what was happening, but they never understood why. NASA continued to implement a series of small corrective actions, living with the problems until it was too late.”
“工程师们和经理们把恶化的反常归纳入工程经验数据库,成了弹性腰带,可以扩大来接受对初始设计更大的超差。没有导致灾难性失效的反常都成了有效的工程数据用于确定今后的飞行可行。这些反常被融入安全裕度有极大的影响,允许工程师们和经理们一点点增加了不可接受的危险的严重性和数量。O型密封圈和泡沫脱落事件在NASA的飞行可行性审查中反复被“强调”,但是从来没有彻底解决。两个事故中,工程分析都不完备和充分。工程师们知道会发生什么,但是他们都不知道为什么。NASA继续实施小打小闹的修补,直到问题发展到回天无力。”
“But it was a reverse of the usual circumstance: instead of having to prove it was safe to fly, they were asked to prove that it was unsafe to fly.”
“但是它与正常的情况相反:不是要求(工程师们)证明飞行是安全的,他们被要求证明飞行是不安全的。”(这就是说如果出现了违反设计要求的超差,工程师们如果不能证明这些超差不安全,就要同意飞行,带着不确定的潜在危险飞行。)
“NASA administrators, reacting to government pressures, transferred more functions and responsibilities to the private sector. The change was cost-efficient, but personnel cuts reduced oversight of contractors at the same time that the agency's dependence upon contractor engineering judgment increased.When high-risk technology is the product and lives
are at stake, safety, oversight, and communication flows are critical. The Board found that the Shuttle Program's normal chain of command and matrix system did not perform a check-and-balance function on either foam or O-rings.”
“NASA 当局屈从与政府的压力,把很多职能和责任转包给了私营企业。这个转变成本效益高,但是人员缩减削弱了对承包商的监督同时也削弱了机构对承包商日益增长的工程判断的独立性。一旦高风险技术是产品以及生命受到威胁,安全、监督和交流就至关重要了。无论是泡沫还是O型密封圈,调查委员会发现航天飞机项目的正常指令链和网状系统都没有履行监督制约职能。”
“The NASA culture encouraged flying with flaws because the schedule could not be held up for routine problems that were not defined as a threat to mission safety.”
“NASA的文化鼓励了带着缺欠飞行,因为时间表是不能因为没有被认定为威胁任务安全的日常问题改变。”
“Safety oriented organizations often build in checks and balances to identify and monitor signals of potential danger. If these checks and balances were in place in the Shuttle Program, they weren't working.”
“面向安全的组织通常有内部监督制约机制来发现和监督潜在的威胁信号。如果航天飞机项目有这些监督制约机制,它们也不会发挥作用”
“Pre-Challenger budget shortages resulted in safety personnel cutbacks. Without clout or independence, the safety personnel who remained were ineffective.”
“挑战者号之前的预算削减导致了安全人员的削减。没有权力或独立性,留下的安全人员也无法有效工作。”
“This can-do attitude bolstered administrators’ belief in an achievable launch rate, the belief that they had an operational system, and an unwillingness to listen to outside experts.”
“这种无所不能的态度助长了NASA当局对可取得的发射率的盲目自信,他们盲目地认为他们有一个运营系统(而不是研发中的系统),不愿意听取外部专家们的意见。”
“The Board found that even after the loss of Challenger, NASA was guilty of treating an experimental vehicle as if it were operational and of not listening to outside experts.”
“调查委员会发现即使在挑战者号事故之后,NASA仍然对把一个实验性的工具当做运营性并且不听取外部专家们的意见负有责任。”
“First, if success is measured by launches and landings, the machine appeared to be working successfully prior to both accidents.Challenger was the 25th launch. Seventeen years and 87 missions passed without major incident. Second, previous policy decisions again had an impact. NASA's Apollo-era research and development culture and its prized deference
to the technical expertise of its working engineers was overridden in the Space Shuttle era by “bureaucratic accountability” – an allegiance to hierarchy, procedure, and following the chain of command.”
“首先,如果成功是用发射和返回的次数来度量的,这种机器看来在两起事故之前都是成功的。挑战者号是在第25次发射出的事故。此后17年87次飞行都没有重大事故。其次,以前的政策决策再次起了作用。NASA的阿波罗时代研发文化和它的令人交口称赞的对它的工程师们的专业技术知识的尊重被航天飞机时代的‘官僚主义责任制’--对组织层次,程序和指令链的遵循的愚忠--践踏了。”
“These engineers could not prove that foam strikes and cold temperatures were unsafe, even though the previous analyses that declared them safe had been incomplete and were based on insufficient data and testing.”
“这些工程师们不能证明泡沫撞击和寒冷温度是不安全因素,尽管先前的宣布它们安全的分析是不完备的和建立在不充分的数据和实验基础上的。”
“The Space Shuttle Program had not produced good data on the correlation between cold temperature and O-ring resilience or good data on the potential effect of bipod ramp foam debris hits.”
“航天飞机项目没有获得寒冷温度和O型密封圈的弹性的关系或双杆斜面泡沫撞击的效果的可靠数据。”
“I had no data to quantify it. But I did say I knew it was away from
goodness in the current data base.”
“我没有数据把它量化。但是我的确说了我知道它与我们当前的数据库的可靠性偏离了。”--一位接受调查的工程师说。
简单地说就是挑战者号和哥伦比亚号在使用中都出现了设计规范不允许使用的超差,但是NASA的工程师们和经理们都没有遵守设计规范,在没有足够的依据证明飞行是安全的的情况下,继续飞行,直到发生机毁人亡的事故。
加载中,请稍候......