JP2025534223A

Movatterモバイル変換

Info

Publication number: JP2025534223A
Application number: JP2025514516A
Authority: JP
Inventors: ハンナビーゲル; ヨランダハガー; ケルシーエム．ロウピー; クレアパターソン; レイチェルエム．オストロフ; ローラメイサンプソン
Original assignee: Somalogic Inc
Current assignee: Somalogic Inc
Priority date: 2022-09-23
Filing date: 2023-09-22
Publication date: 2025-10-15

Abstract

本開示は、個体のタバコ使用状態を予測するためのバイオマーカー、方法、デバイス、試薬、システム、及びキットを含む。一態様において、本開示は、個体のタバコ使用状態を予測するために単独でまたは様々な組み合わせで使用され得るバイオマーカーを提供する。別の態様において、方法は、個体のタバコ使用状態の予測のために提供され、方法は、個体からの生物学的サンプル中の、表４中で提供されるバイオマーカーの群から選択される少なくとも１つのバイオマーカーに対応する少なくとも１つのバイオマーカー値を検出することを含む。The present disclosure includes biomarkers, methods, devices, reagents, systems, and kits for predicting tobacco use status in an individual. In one aspect, the disclosure provides biomarkers that can be used alone or in various combinations to predict tobacco use status in an individual. In another aspect, methods are provided for predicting tobacco use status in an individual, the method comprising detecting at least one biomarker value corresponding to at least one biomarker selected from the group of biomarkers provided in Table 4 in a biological sample from the individual.

Description

Translated fromJapanese

関連出願の相互参照
本出願は、２０２２年９月２３日に出願された米国仮出願第６３／４０９，３３８号の優先権の利益を主張し、同出願は、参照することによりその全体がすべての目的のために本明細書に援用される。 CROSS-REFERENCE TO RELATED APPLICATIONS This application claims the benefit of priority to U.S. Provisional Application No. 63/409,338, filed September 23, 2022, which is incorporated herein by reference in its entirety for all purposes.

本出願は、概して、個体のタバコ使用状態を予測するバイオマーカーの検出及び方法、ならびにより具体的には、個体のタバコ使用状態を予測するために使用される１つまたは複数のバイオマーカー、方法、デバイス、試薬、システム、及びキットに関する。This application relates generally to detection and methods of biomarkers that predict tobacco use status in an individual, and more specifically to one or more biomarkers, methods, devices, reagents, systems, and kits used to predict tobacco use status in an individual.

以下の記載は、本出願に関連し得る情報の要約を提供するものでああり、本明細書において提供される情報または参照される出版物のうちの任意のものが本出願の先行技術であると承認するものではない。The following provides a summary of information that may be relevant to the present application and is not an admission that any of the information provided herein or any of the publications referenced herein is prior art to the present application.

ＣｅｎｔｅｒｓｆｏｒＤｉｓｅａｓｅＣｏｎｔｒｏｌａｎｄＰｒｅｖｅｎｔｉｏｎ（ＣＤＣ）によれば、タバコを吸うことは、米国における毎年４８０，０００人超の死亡の直接原因である。喫煙は、生活の質を減少させる多数の慣性的な健康状態（心血管性疾患、肺疾患、及び肺癌が挙げられる）と強く相関し、喫煙の量及び頻度は、疾患のリスクの増加に正比例する（ＢａｎｋｓＥ，ｅｔａｌ．，ＢＭＣＭｅｄ，２０１９；１７（１）：１２８）。４０歳前の禁煙は、喫煙関連疾患で死亡するリスクを９０％まで低減することができ、したがって、Ｕ．Ｓ．ＤｅｐａｒｔｍｅｎｔｏｆＨｅａｌｔｈａｎｄＳｅｒｖｉｃｅｓは、ほぼ６０年間多量の喫煙の中断を奨励している（ＢｒａｗｌｅｙＯＷ，ｅｔａｌ．ＣＡＣａｎｃｅｒＪＣｌｉｎ，２０１４；６４（１）：５－８）。「以前の」喫煙者は、「１度も喫煙したことがない」者に比べて併存症を発症するリスクもより高い（ＫｒａｍａｒｏｗＥＡ．Ｈｅａｌｔｈｏｆｆｏｒｍｅｒｃｉｇａｒｅｔｔｅｓｍｏｋｅｒｓａｇｅｄ６５ａｎｄｏｖｅｒ：ＵｎｉｔｅｄＳｔａｔｅｓ，２０１８．ＮａｔｉｏｎａｌＣｅｎｔｅｒｆｏｒＨｅａｌｔｈＳｔａｔｉｓｔｉｃｓ．Ｊｕｌｙ２０２０．同文献は、ｈｔｔｐｓ：／／ｗｗｗ．ｃｄｃ．ｇｏｖ／ｎｃｈｓ／ｄａｔａ／ｎｈｓｒ／ｎｈｓｒ１４５－５０８．ｐｄｆ．から利用可能）。According to the Centers for Disease Control and Prevention (CDC), tobacco smoking is the direct cause of more than 480,000 deaths each year in the United States. Smoking is strongly correlated with numerous chronic health conditions that reduce quality of life, including cardiovascular disease, pulmonary disease, and lung cancer, and the amount and frequency of smoking are directly proportional to the increased risk of disease (Banks E, et al., BMC Med, 2019;17(1):128). Quitting smoking before age 40 can reduce the risk of dying from smoking-related diseases by up to 90%, thus reducing the risk of death from U.S. The Department of Health and Services has been encouraging heavy smoking cessation for nearly 60 years (Brawley OW, et al. CA Cancer J Clin, 2014;64(1):5-8). Former smokers are also at higher risk of developing comorbidities than never smokers (Kramarow EA. Health of former cigarette smokers aged 65 and over: United States, 2018. National Center for Health Statistics. July 2020. Available at https://www.cdc.gov/nchs/data/nhsr/nhsr145-508.pdf).

ＴｈｅＣｌｉｎｉｃａｌＰｒａｃｔｉｃｅＧｕｉｄｅｌｉｎｅＴｒｅａｔｉｎｇＴｏｂａｃｃｏＵｓｅａｎｄＤｅｐｅｎｄｅｎｃｅ２００８ＵｐｄａｔｅＰａｎｅｌ，Ｌｉａｉｓｏｎｓ，ａｎｄＳｔａｆｆによって記載される現在の標準的な医療ガイドラインは、プライマリケア提供者によるタバコ介入についての５Ａ：「（１）タバコを使用しているかどうかを患者に尋ねる（ａｓｋ）、（２）中止するに助言する（ａｄｖｉｓｅ）、（３）中止しようとする意思を査定する（ａｓｓｅｓｓ）、（４）中止しようとする意思のある者を支援する（ａｓｓｉｓｔ）、及び（５）後戻りを予防するためにフォローアップの連絡を手配する（ａｒｒａｎｇｅ）」を推奨している（ＴｈｅＣｌｉｎｉｃａｌＰｒａｃｔｉｃｅＧｕｉｄｅｌｉｎｅＴｒｅａｔｉｎｇＴｏｂａｃｃｏＵｓｅａｎｄＤｅｐｅｎｄｅｎｃｅ２００８ＵｐｄａｔｅＰａｎｅｌ，Ｌｉａｉｓｏｎｓ，ａｎｄＳｔａｆｆ．ＡＣｌｉｎｉｃａｌＰｒａｃｔｉｃｅＧｕｉｄｅｌｉｎｅｆｏｒＴｒｅａｔｉｎｇＴｏｂａｃｃｏＵｓｅａｎｄＤｅｐｅｎｄｅｎｃｅ：２００８Ｕｐｄａｔｅ，ＡＵ．Ｓ．ＰｕｂｌｉｃＨｅａｌｔｈＳｅｒｖｉｃｅＲｅｐｏｒｔ．ＡｍＪＰｒｅｖＭｅｄ，２００８；３５（２）：１５８－１７６）。実際面では、患者の喫煙関連疾患のリスク及びスクリーニングについて適格性を査定するために、臨床医は、多くの場合患者の自己申告の喫煙習慣に頼る。二次的に、生化学的尺度（コチニン、一酸化炭素、またはチオシアン酸塩等）は、自己申告した喫煙習慣の正確性を評価するために使用され得るが、すべての喫煙の生化学的尺度は「現在の」喫煙習慣の査定に限定される（ＪａｒｖｉｓＭＪ，ｅｔａｌ．ＡｍＪＰｕｂｌｉｃＨｅａｌｔｈ，１９８７；７７：１４３５－１４３８．ＰａｔｒｉｃｋＤＬ，ｅｔａｌ．，ＡｍＪＰｕｂｌｉｃＨｅａｌｔｈ，１９９４；８４（７）：１０８６－１０９３）。例えば血液ベースの測定によってタバコ使用状態を査定する代替物の開発は、「以前の」タバコ使用者そして「現在の」タバコ使用者を包含する臨床的に妥当な集団を同定する能力を有する。The current standard of care guidelines, as outlined by the Clinical Practice Guideline Treating Tobacco Use and Dependence 2008 Update Panel, Liaisons, and Staff, recommend the 5 As for tobacco intervention by primary care providers: (1) ask the patient if they are using tobacco, (2) advise them to quit, (3) assess their intention to quit, (4) assist those who are willing to quit, and (5) arrange for follow-up contact to prevent relapse. Dependence 2008 Update Panel, Liaisons, and Staff. A Clinical Practice Guideline for Treating Tobacco Use and Dependence: 2008 Update, A U.S. Public Health Service Report. Am J Prev Med, 2008;35(2):158-176. In practice, clinicians often rely on patients' self-reported smoking habits to assess their risk for smoking-related diseases and their eligibility for screening. Secondarily, biochemical measures (such as cotinine, carbon monoxide, or thiocyanate) can be used to assess the accuracy of self-reported smoking habits, but all biochemical measures of smoking are limited to assessing "current" smoking habits (Jarvis MJ, et al. Am J Public Health, 1987;77:1435-1438. Patrick DL, et al. Am J Public Health, 1994;84(7):1086-1093). Development of surrogates to assess tobacco use status, for example, via blood-based measures, has the potential to identify clinically relevant populations that encompass "former" and "current" tobacco users.

最近、ＵｎｉｔｅｄＳｔａｔｅｓＰｒｅｖｅｎｔａｔｉｖｅＳｅｒｖｉｃｅｓＴａｓｋＦｏｒｃｅ（ＵＳＰＳＴＦ）は、行動的介入及び薬理学的介入が、妊娠していない個体における禁煙を有意に増加すると結論した（ＰａｔｎｏｄｅＣＤ，ｅｔａｌ．，ＪＡＭＡ，２０２１；３２５（３）：２８０－２９８）。喫煙を中止する意思のある患者のために、介入としてはカウンセリングまたは医薬物が挙げられ；中止する意思のない患者のために、臨床医は、中止を検討するように患者の意欲を起こさせる介入（情報の提供及び助言等）を遂行することが奨励される。最近喫煙を中止した患者のために、臨床医は、後戻りを予防するために患者を支援するように助言されるが、数年間喫煙を中止している患者においては、後戻りの防止は要求されない。Recently, the United States Preventative Services Task Force (USPSTF) concluded that behavioral and pharmacological interventions significantly increase smoking cessation in non-pregnant individuals (Patnode CD, et al., JAMA, 2021;325(3):280-298). For patients who intend to quit smoking, interventions include counseling or medication; for patients who do not intend to quit, clinicians are encouraged to implement interventions (such as providing information and advice) that motivate patients to consider quitting. For patients who have recently quit smoking, clinicians are advised to support patients in preventing relapse, although prevention of relapse is not required in patients who have quit smoking for several years.

複数の第一選択治療：ブプロピオンＳＲ、ニコチンガム、ニコチン吸入剤、ニコチンロゼンジ、ニコチン鼻腔スプレー、ニコチンパッチ、またはバレニクリンのうちの１つは、喫煙の中止を支援するために患者に処方され得る。第一選択治療についての禁忌（妊娠等）が同定されたか、または患者が第一選択治療に応答していない限り、第二選択治療（クロニジン及びノルトリプチリンが挙げられる）は推奨されない（ＴｈｅＣｌｉｎｉｃａｌＰｒａｃｔｉｃｅＧｕｉｄｅｌｉｎｅＴｒｅａｔｉｎｇＴｏｂａｃｃｏＵｓｅａｎｄＤｅｐｅｎｄｅｎｃｅ２００８ＵｐｄａｔｅＰａｎｅｌ，Ｌｉａｉｓｏｎｓ，ａｎｄＳｔａｆｆ．ＡＣｌｉｎｉｃａｌＰｒａｃｔｉｃｅＧｕｉｄｅｌｉｎｅｆｏｒＴｒｅａｔｉｎｇＴｏｂａｃｃｏＵｓｅａｎｄＤｅｐｅｎｄｅｎｃｅ：２００８Ｕｐｄａｔｅ，ＡＵ．Ｓ．ＰｕｂｌｉｃＨｅａｌｔｈＳｅｒｖｉｃｅＲｅｐｏｒｔ．ＡｍＪＰｒｅｖＭｅｄ，２００８；３５（２）：１５８－１７６）。One of several first-line treatments may be prescribed to patients to assist them in quitting smoking: bupropion SR, nicotine gum, nicotine inhalant, nicotine lozenges, nicotine nasal spray, nicotine patch, or varenicline. Second-line treatments (including clonidine and nortriptyline) are not recommended unless a contraindication to the first-line treatment (such as pregnancy) is identified or the patient has not responded to the first-line treatment (The Clinical Practice Guideline Treating Tobacco Use and Dependence 2008 Update Panel, Liaisons, and Staff. A Clinical Practice Guideline for Treating Tobacco Use and Dependence: 2008 Update, A U.S. Public Health Service Report. Am J Prev Med, 2008; 35(2): 158-176).

コチニン（ニコチン代謝産物）は、多くの場合、「現在の」喫煙習慣を同定するために使用されるが、「以前の」喫煙者を同定し得る臨床業務において使用されるバイオマーカーは現在のところない（ＣｅｎｔｅｒｓｆｏｒＤｉｓｅａｓｅＣｏｎｔｒｏｌａｎｄＰｒｅｖｅｎｔｉｏｎ．ＢｉｏｍｏｎｉｔｏｒｉｎｇＳｕｍｍａｒｙ：Ｃｏｔｉｎｉｎｅ．Ａｐｒｉｌ２０１７．同文献は、ｗｗｗ．ｃｄｃ．ｇｏｖ／ｂｉｏｍｏｎｉｔｏｒｉｎｇ／Ｃｏｔｉｎｉｎｅ＿ＢｉｏｍｏｎｉｔｏｒｉｎｇＳｕｍｍａｒｙ．ｈｔｍｌから利用可能）。特異的遺伝子の所在位置でのＤＮＡメチル化を測定するエピジェネティック検査は利用可能であり、ＢｅｈａｖｉｏｒａｌＤｉａｇｎｏｓｔｉｃｓ，ＬＬＣによるＳｍｏｋｅＳｉｇｎａｔｕｒｅ（登録商標）検査等であるが、市販されているように、この検査は、「現在の」喫煙者を「１度も喫煙したことがない」者から層別化することのみを請求する（ＰｈｉｌｉｂｅｒｔＲ，ｅｔａｌ．，ＦｒｏｎｔＧｅｎｅｔ，２０１８；９：１３７；ＤａｗｅｓＫ，ｅｔａｌ．ＳｃｉＲｅｐ，２０２１；１１（１）：２１６２７）。Cotinine (a nicotine metabolite) is often used to identify "current" smoking habits, but there are currently no biomarkers used in clinical practice that can identify "former" smokers (Centers for Disease Control and Prevention. Biomonitoring Summary: Cotinine. April 2017. The reference is available at www.cdc.gov/biomonitoring/Cotinine_BiomonitoringSummary.html). Epigenetic tests that measure DNA methylation at specific gene locations are available, such as the Smoke Signature® test by Behavioral Diagnostics, LLC; however, as commercially available, this test only claims to stratify "current" smokers from "never smokers" (Philibert R, et al., Front Genet, 2018;9:137; Dawes K, et al. Sci Rep, 2021;11(1):21627).

個体がこれまでにタバコ使用者であった確率を予測するためのプロテオームモデルの開発は、高度に所望されるだろう。かかるプロテオームモデルは、主観的な自己申告、誤解を招く自己申告、または偽造の可能性のある自己申告に依存すること無しに、現在または以前の（すなわち「これまでの」）タバコ使用習慣についての知見を提供するだろう。個体がこれまでにタバコ使用者であった確率の予測を可能にするバイオマーカー、方法、デバイス、試薬、システム、及びキットについての必要性が存在する。The development of proteomic models for predicting the probability that an individual is an ever tobacco user would be highly desirable. Such proteomic models would provide insight into current or former (i.e., "ever") tobacco use habits without relying on subjective, misleading, or potentially falsified self-reports. A need exists for biomarkers, methods, devices, reagents, systems, and kits that enable prediction of the probability that an individual is an ever tobacco user.

本出願は、個体がこれまでにタバコ使用者であった確率を予測するためのバイオマーカー、方法、試薬、デバイス、システム、及びキットを包含する。いくつかの実施形態において、個体がこれまでにタバコ使用者であった確率を予測する方法が提供される。いくつかの実施形態において、サンプル中のＮ個のバイオマーカータンパク質のレベルを検出する方法が提供される。This application encompasses biomarkers, methods, reagents, devices, systems, and kits for predicting the probability that an individual is a former tobacco user. In some embodiments, methods are provided for predicting the probability that an individual is a former tobacco user. In some embodiments, methods are provided for detecting the levels of N biomarker proteins in a sample.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、対象からのサンプル中のＥＰＨＡ６バイオマーカータンパク質のレベル及びＮ個のバイオマーカータンパク質の各々のレベルを検出することを含み、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つが、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される、前記方法が提供される。In some embodiments, a method for predicting the probability that a subject is a former tobacco user comprises detecting a level of an EPHA6 biomarker protein and a level of each of N biomarker proteins in a sample from the subject, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, at least 39, at least 40, at least 41, at least 42, at least 43, at least 44, at least 45, at least 46, at least 47, at least 48, at least 49, at least 50, at least 51, at least 52, at least 53, at least 54, at least 55, at least 56, at least 57, at least 58, at least 59, at least 60, at least 61, at least 62, at least 63, at least 64, at least 65, at least 66, at least 67, at least 68, at least 69, at least 70, at least 71, at least 72, at least 73, at least 74, at least 75, at least 76, at least 77, at least 78, 2, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, or at least 39, and at least one of the N biomarker proteins is placental alkaline phosphatase, SCF, MASP3: heavy chain, DUS10, IgG4-κ, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, or senescence. The method further provides that the target protein is selected from cletoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、サンプル中のＮ個のバイオマーカータンパク質のレベルを検出する方法であって、対象からサンプルを得ること及び対象からのサンプル中のＮ個のバイオマーカータンパク質の各々のレベルを検出することを含み、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つが、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される、前記方法が提供される。In some embodiments, a method for detecting levels of N biomarker proteins in a sample includes obtaining a sample from a subject and detecting levels of each of N biomarker proteins in the sample from the subject, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, or at least At least one of the N biomarker proteins is EPHA6, placental alkaline phosphatase, SCF, MASP3: heavy chain, DUS10, IgG4-κ, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, The method provides a method in which the target protein is selected from secretoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、対象からのサンプル中の胎盤型アルカリホスファターゼバイオマーカータンパク質のレベル及びＮ個のバイオマーカータンパク質の各々のレベルを検出することを含み、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つが、ＥＰＨＡ６、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される、前記方法が提供される。In some embodiments, a method for predicting the probability that a subject is a former tobacco user comprises detecting a level of placental alkaline phosphatase biomarker protein and a level of each of N biomarker proteins in a sample from the subject, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, or at least 39, and at least one of the N biomarker proteins is EPHA6, SCF, MASP3:heavy chain, DUS10, IgG4-κ, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, or senescence. The method further provides that the target protein is selected from cletoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、対象からのサンプル中のＳＣＦバイオマーカータンパク質のレベル及びＮ個のバイオマーカータンパク質の各々のレベルを検出することを含み、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つが、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される、前記方法が提供される。In some embodiments, a method for predicting the probability that a subject is a former tobacco user comprises detecting a level of an SCF biomarker protein and a level of each of N biomarker proteins in a sample from the subject, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, or at least 39, and at least one of the N biomarker proteins is EPHA6, placental alkaline phosphatase, MASP3: heavy chain, DUS10, IgG4-κ, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, or senescence. The method further provides that the target protein is selected from cletoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、対象からのサンプル中のＭＡＳＰ３：重鎖バイオマーカータンパク質のレベル及びＮ個のバイオマーカータンパク質の各々のレベルを検出することを含み、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つが、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される、前記方法が提供される。In some embodiments, a method for predicting the probability that a subject is a former tobacco user comprises detecting a level of a MASP3:heavy chain biomarker protein and a level of each of N biomarker proteins in a sample from the subject, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, or at least At least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, or at least 39, and at least one of the N biomarker proteins is EPHA6, placental alkaline phosphatase, SCF, DUS10, IgG4-κ, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, or senescence. The method further provides that the target protein is selected from cletoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、対象からのサンプル中のＤＵＳ１０バイオマーカータンパク質のレベル及びＮ個のバイオマーカータンパク質の各々のレベルを検出することを含み、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つが、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される、前記方法が提供される。In some embodiments, a method for predicting the probability that a subject is a former tobacco user comprises detecting a level of a DUS10 biomarker protein and a level of each of N biomarker proteins in a sample from the subject, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 3 2, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, or at least 39, and at least one of the N biomarker proteins is EPHA6, placental alkaline phosphatase, SCF, MASP3: heavy chain, IgG4-κ, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, or senescence. The method further provides that the target protein is selected from cletoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、対象からのサンプル中のＩｇＧ４－κバイオマーカータンパク質のレベル及びＮ個のバイオマーカータンパク質の各々のレベルを検出することを含み、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つが、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される、前記方法が提供される。In some embodiments, a method for predicting the probability that a subject is a former tobacco user comprises detecting a level of an IgG4-κ biomarker protein and a level of each of N biomarker proteins in a sample from the subject, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, or at least 39, and at least one of the N biomarker proteins is EPHA6, placental alkaline phosphatase, SCF, MASP3: heavy chain, DUS10, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, or senescence. The method further provides that the target protein is selected from cletoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、対象からのサンプル中のＤＣＣバイオマーカータンパク質のレベル及びＮ個のバイオマーカータンパク質の各々のレベルを検出することを含み、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つが、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される、前記方法が提供される。In some embodiments, a method for predicting the probability that a subject is a former tobacco user comprises detecting a level of a DCC biomarker protein and a level of each of N biomarker proteins in a sample from the subject, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, or at least 39, and at least one of the N biomarker proteins is EPHA6, placental alkaline phosphatase, SCF, MASP3: heavy chain, DUS10, IgG4-κ, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, or senescence. The method further provides that the target protein is selected from cletoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、対象からのサンプル中のＲＣＡＮ３バイオマーカータンパク質のレベル及びＮ個のバイオマーカータンパク質の各々のレベルを検出することを含み、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つが、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される、前記方法が提供される。In some embodiments, a method for predicting the probability that a subject is a former tobacco user comprises detecting a level of an RCAN3 biomarker protein and a level of each of N biomarker proteins in a sample from the subject, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 3 2, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, or at least 39, and at least one of the N biomarker proteins is EPHA6, placental alkaline phosphatase, SCF, MASP3: heavy chain, DUS10, IgG4-κ, DCC, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, or senescence. The method further provides that the target protein is selected from cletoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、対象からのサンプル中のＣＤ２４８バイオマーカータンパク質のレベル及びＮ個のバイオマーカータンパク質の各々のレベルを検出することを含み、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つが、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される、前記方法が提供される。In some embodiments, a method for predicting the probability that a subject is a former tobacco user comprises detecting a level of CD248 biomarker protein and a level of each of N biomarker proteins in a sample from the subject, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, at least 39, at least 40, at least 41, at least 42, at least 43, at least 44, at least 45, at least 46, at least 47, at least 48, at least 49, at least 50, at least 51, at least 52, at least 53, at least 54, at least 55, at least 56, at least 57, at least 58, at least 59, at least 60, at least 61, at least 62, at least 63, at least 64, at least 65, at least 66, at least 67, at least 68, at least 69, at least 70, at least 71, at least 72, at least 73, at least 74, at least 75, at least 76, at least 77, at least 78, at 2, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, or at least 39, and at least one of the N biomarker proteins is EPHA6, placental alkaline phosphatase, SCF, MASP3: heavy chain, DUS10, IgG4-κ, DCC, RCAN3, EDIL3, renin, PSP-94, PIGR, URB, agrin, or senescence. The method further provides that the target protein is selected from cletoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、Ｎは２～３９であるか、またはＮは３～３９であるか、またはＮは４～３９であるか、またはＮは５～３９であるか、またはＮは６～３９であるか、またはＮは７～３９であるか、またはＮは８～３９であるか、またはＮは９～３９であるか、またはＮは１０～３９であるか、またはＮは１１～３９であるか、またはＮは１２～３９であるか、またはＮは１３～３９であるか、またはＮは１４～３９であるか、Ｎは１５～３９であるか、Ｎは１６～３９であるか、Ｎは１７～３９であるか、Ｎは１８～３９であるか、Ｎは１９～３９であるか、Ｎは２０～３９であるか、Ｎは２１～３９であるか、Ｎは２２～３９であるか、Ｎは２３～３９であるか、Ｎは２４～３９であるか、Ｎは２５～３９であるか、Ｎは２６～３９であるか、Ｎは２７～３９であるか、Ｎは２８～３９であるか、Ｎは２９～３９であるか、Ｎは３０～３９であるか、Ｎは３１～３９であるか、Ｎは３２～３９であるか、Ｎは３３～３９であるか、Ｎは３４～３９であるか、Ｎは３５～３９であるか、Ｎは３６～３９であるか、Ｎは３７～３９であるか、またはＮは３８～３９である。いくつかの実施形態において、Ｎは２であるか、Ｎは３であるか、またはＮは４であるか、またはＮは５であるか、またはＮは６であるか、またはＮは７であるか、またはＮは８であるか、またはＮは９であるか、またはＮは１０であるか、またはＮは１１であるか、またはＮは１２であるか、またはＮは１３であるか、またはＮは１４であるか、またはＮは１５であるか、またはＮは１６であるか、またはＮは１７であるか、またはＮは１８であるか、またはＮは１９であるか、またはＮは２０であるか、またはＮは２１であるか、またはＮは２２であるか、またはＮは２３であるか、またはＮは２４であるか、Ｎは２５であるか、Ｎは２６であるか、Ｎは２７であるか、Ｎは２８であるか、Ｎは２９であるか、Ｎは３０であるか、Ｎは３１であるか、Ｎは３２であるか、Ｎは３３であるか、Ｎは３４であるか、Ｎは３５であるか、Ｎは３６であるか、Ｎは３７であるか、Ｎは３８であるか、またはＮは３９である。In some embodiments, N is 2-39, or N is 3-39, or N is 4-39, or N is 5-39, or N is 6-39, or N is 7-39, or N is 8-39, or N is 9-39, or N is 10-39, or N is 11-39, or N is 12-39, or N is 13-39, or N is 14-39, or N is 15-39, or N is 16-39, or N is 17-39, or N is 18-39 or N is 19-39, or N is 20-39, or N is 21-39, or N is 22-39, or N is 23-39, or N is 24-39, or N is 25-39, or N is 26-39, or N is 27-39, or N is 28-39, or N is 29-39, or N is 30-39, or N is 31-39, or N is 32-39, or N is 33-39, or N is 34-39, or N is 35-39, or N is 36-39, or N is 37-39, or N is 38-39. In some embodiments, N is 2, or N is 3, or N is 4, or N is 5, or N is 6, or N is 7, or N is 8, or N is 9, or N is 10, or N is 11, or N is 12, or N is 13, or N is 14, or N is 15, or N is 16, or N is 17, or N is 18, or N is 19, or N is 20, or N is 21, or N is 22, or N is 23, or N is 24, or N is 25, or N is 26, or N is 27, or N is 28, or N is 29, or N is 30, or N is 31, or N is 32, or N is 33, or N is 34, or N is 35, or N is 36, or N is 37, or N is 38, or N is 39.

いくつかの実施形態において、Ｎ個のバイオマーカータンパク質の各々は、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択される。いくつかの実施形態において、Ｎ個のバイオマーカータンパク質のうちの少なくとも１つは、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、及びＣＤ２４８から選択される。いくつかの実施形態において、Ｎ個のタンパク質バイオマーカーのうちの少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、または少なくとも９は、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、及びＣＤ２４８から選択される。In some embodiments, each of the N biomarker proteins is selected from EPHA6, placental alkaline phosphatase, SCF, MASP3: heavy chain, DUS10, IgG4-κ, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, secretoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B. In some embodiments, at least one of the N biomarker proteins is selected from EPHA6, placental alkaline phosphatase, SCF, MASP3:heavy chain, DUS10, IgG4-κ, DCC, RCAN3, and CD248. In some embodiments, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, or at least 9 of the N protein biomarkers are selected from EPHA6, placental alkaline phosphatase, SCF, MASP3:heavy chain, DUS10, IgG4-κ, DCC, RCAN3, and CD248.

いくつかの実施形態において、Ｎ個のバイオマーカータンパク質のうちの２つはＥＰＨＡ６及び胎盤型アルカリホスファターゼであるか、またはＮ個のバイオマーカータンパク質のうちの２つはＥＰＨＡ６及びＳＣＦであるか、またはＮ個のバイオマーカータンパク質のうちの２つはＥＰＨＡ６及びＭＡＳＰ３：重鎖であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＥＰＨＡ６及びＤＵＳ１０であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＥＰＨＡ６及びＩｇＧ４－κであるか、またはＮ個のバイオマーカータンパク質のうちの２つはＥＰＨＡ６及びＤＣＣであるか、またはＮ個のバイオマーカータンパク質のうちの２つはＥＰＨＡ６及びＲＣＡＮ３であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＥＰＨＡ６及びＣＤ２４８である。いくつかの実施形態において、Ｎ個のバイオマーカータンパク質のうちの２つは胎盤型アルカリホスファターゼ及びＳＣＦであるか、またはＮ個のバイオマーカータンパク質のうちの２つは胎盤型アルカリホスファターゼ及びＭＡＳＰ３：重鎖であるか、またはＮ個のバイオマーカータンパク質のうちの２つは胎盤型アルカリホスファターゼ及びＤＵＳ１０であるか、またはＮ個のバイオマーカータンパク質のうちの２つは胎盤型アルカリホスファターゼ及びＩｇＧ４－κであるか、またはＮ個のバイオマーカータンパク質のうちの２つは胎盤型アルカリホスファターゼ及びＤＣＣであるか、またはＮ個のバイオマーカータンパク質のうちの２つは胎盤型アルカリホスファターゼ及びＲＣＡＮ３であるか、またはＮ個のバイオマーカータンパク質のうちの２つは胎盤型アルカリホスファターゼ及びＣＤ２４８である。いくつかの実施形態において、Ｎ個のバイオマーカータンパク質のうちの２つはＳＣＦ及びＭＡＳＰ３：重鎖であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＳＣＦ及びＤＵＳ１０であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＳＣＦ及びＩｇＧ４－κであるか、またはＮ個のバイオマーカータンパク質のうちの２つはＳＣＦ及びＤＣＣであるか、またはＮ個のバイオマーカータンパク質のうちの２つはＳＣＦ及びＲＣＡＮ３であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＳＣＦ及びＣＤ２４８である。いくつかの実施形態において、Ｎ個のバイオマーカータンパク質のうちの２つはＭＡＳＰ３：重鎖及びＤＵＳ１０であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＭＡＳＰ３：重鎖及びＩｇＧ４－κであるか、またはＮ個のバイオマーカータンパク質のうちの２つはＭＡＳＰ３：重鎖及びＲＣＡＮ３であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＭＡＳＰ３：重鎖及びＣＤ２４８であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＭＡＳＰ３：重鎖及びＤＣＣである。いくつかの実施形態において、Ｎ個のバイオマーカータンパク質のうちの２つはＤＵＳ１０及びＩｇＧ４－κであるか、またはＮ個のバイオマーカータンパク質のうちの２つはＤＵＳ１０及びＤＣＣであるか、またはＮ個のバイオマーカータンパク質のうちの２つはＤＵＳ１０及びＣＤ２４８であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＤＵＳ１０及びＲＣＡＮ３である。いくつかの実施形態において、Ｎ個のバイオマーカータンパク質のうちの２つはＩｇＧ４－κ及びＤＣＣであるか、またはＮ個のバイオマーカータンパク質のうちの２つはＩｇＧ４－κ及びＲＣＡＮ３であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＩｇＧ４－κ及びＣＤ２４８である。いくつかの実施形態において、Ｎ個のバイオマーカータンパク質のうちの２つはＤＣＣ及びＲＣＡＮ３であるか、またはＮ個のバイオマーカータンパク質のうちの２つはＤＣＣ及びＣＤ２４８である。いくつかの実施形態において、Ｎ個のバイオマーカータンパク質のうちの２つはＲＣＡＮ３及びＣＤ２４８である。In some embodiments, two of the N biomarker proteins are EPHA6 and placental alkaline phosphatase, or two of the N biomarker proteins are EPHA6 and SCF, or two of the N biomarker proteins are EPHA6 and MASP3: heavy chain, or two of the N biomarker proteins are EPHA6 and DUS10, or two of the N biomarker proteins are EPHA6 and IgG4-κ, or two of the N biomarker proteins are EPHA6 and DCC, or two of the N biomarker proteins are EPHA6 and RCAN3, or two of the N biomarker proteins are EPHA6 and CD248. In some embodiments, two of the N biomarker proteins are placental alkaline phosphatase and SCF, or two of the N biomarker proteins are placental alkaline phosphatase and MASP3:heavy chain, or two of the N biomarker proteins are placental alkaline phosphatase and DUS10, or two of the N biomarker proteins are placental alkaline phosphatase and IgG4-κ, or two of the N biomarker proteins are placental alkaline phosphatase and DCC, or two of the N biomarker proteins are placental alkaline phosphatase and RCAN3, or two of the N biomarker proteins are placental alkaline phosphatase and CD248. In some embodiments, two of the N biomarker proteins are SCF and MASP3:heavy chain, or two of the N biomarker proteins are SCF and DUS10, or two of the N biomarker proteins are SCF and IgG4-κ, or two of the N biomarker proteins are SCF and DCC, or two of the N biomarker proteins are SCF and RCAN3, or two of the N biomarker proteins are SCF and CD248. In some embodiments, two of the N biomarker proteins are MASP3:heavy chain and DUS10, or two of the N biomarker proteins are MASP3:heavy chain and IgG4-κ, or two of the N biomarker proteins are MASP3:heavy chain and RCAN3, or two of the N biomarker proteins are MASP3:heavy chain and CD248, or two of the N biomarker proteins are MASP3:heavy chain and DCC. In some embodiments, two of the N biomarker proteins are DUS10 and IgG4-κ, or two of the N biomarker proteins are DUS10 and DCC, or two of the N biomarker proteins are DUS10 and CD248, or two of the N biomarker proteins are DUS10 and RCAN3. In some embodiments, two of the N biomarker proteins are IgG4-κ and DCC, or two of the N biomarker proteins are IgG4-κ and RCAN3, or two of the N biomarker proteins are IgG4-κ and CD248. In some embodiments, two of the N biomarker proteins are DCC and RCAN3, or two of the N biomarker proteins are DCC and CD248. In some embodiments, two of the N biomarker proteins are RCAN3 and CD248.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、Ｔ１１Ｌ１バイオマーカータンパク質のレベル検出することを含む、前記方法が提供される。いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、ＭＸＲＡ８バイオマーカータンパク質のレベル検出することを含む、前記方法が提供される。いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、ＸＴＰ３Ａバイオマーカータンパク質のレベル検出することを含む、前記方法が提供される。いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率を予測する方法であって、ＣＲＡＣ１バイオマーカータンパク質のレベル検出することを含む、前記方法が提供される。いくつかの実施形態において、方法は、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２Ａサブユニットから選択される少なくとも１つのバイオマーカータンパク質を検出することをさらに含む。In some embodiments, methods are provided for predicting the probability that a subject is a former tobacco user, comprising detecting a level of a T11L1 biomarker protein. In some embodiments, methods are provided for predicting the probability that a subject is a former tobacco user, comprising detecting a level of an MXRA8 biomarker protein. In some embodiments, methods are provided for predicting the probability that a subject is a former tobacco user, comprising detecting a level of an XTP3A biomarker protein. In some embodiments, methods are provided for predicting the probability that a subject is a former tobacco user, comprising detecting a level of a CRAC1 biomarker protein. In some embodiments, the method further comprises detecting at least one biomarker protein selected from EPHA6, placental alkaline phosphatase, SCF, MASP3: heavy chain, DUS10, IgG4-κ, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, secretoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit.

いくつかの実施形態において、サンプルは、血液サンプル、血漿サンプル、または血清サンプルである。いくつかの実施形態において、検出は、質量分析法、アプタマーベースのアッセイ、及び／または抗体ベースのアッセイを使用して遂行される。いくつかの実施形態において、方法は、サンプル（複数可）のバイオマーカータンパク質をバイオマーカー捕捉試薬のセットと接触させることを包含し、バイオマーカー捕捉試薬のセットのうちの各々のバイオマーカー捕捉試薬は、特異的に検出されている異なるバイオマーカータンパク質へ結合する。いくつかの実施形態において、各々のバイオマーカー捕捉試薬は、抗体またはアプタマーである。いくつかの実施形態において、各々のバイオマーカー捕捉試薬は、アプタマーである。いくつかの実施形態において、少なくとも１つのアプタマーは、解離速度の遅いアプタマーである。いくつかの実施形態において、少なくとも１つの解離速度の遅いアプタマーは、少なくとも１つ、少なくとも２つ、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、または少なくとも１０の修飾を備えたヌクレオチドを含む。いくつかの実施形態において、各々の解離速度の遅いアプタマーは、≧２０分、≧３０分、≧６０分、≧９０分、≧１２０分、≧１５０分、≧１８０分、≧２１０分、または≧２４０分の解離速度（ｔ_１／２）で、その標的タンパク質へ結合する。いくつかの実施形態において、測定された各々のバイオマーカータンパク質のレベルは、相対蛍光単位（ＲＦＵ）またはタンパク質濃度から決定される。 In some embodiments, the sample is a blood sample, a plasma sample, or a serum sample. In some embodiments, detection is accomplished using mass spectrometry, an aptamer-based assay, and/or an antibody-based assay. In some embodiments, the method comprises contacting biomarker proteins of the sample(s) with a set of biomarker capture reagents, wherein each biomarker capture reagent of the set of biomarker capture reagents specifically binds to a different biomarker protein being detected. In some embodiments, each biomarker capture reagent is an antibody or an aptamer. In some embodiments, each biomarker capture reagent is an aptamer. In some embodiments, at least one aptamer is a slow-off aptamer. In some embodiments, the at least one slow-off aptamer comprises at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten nucleotides with modifications. In some embodiments, each slow off-rate aptamer binds to its target protein with a off-rate (t_1/2 ) of ≧20 minutes, ≧30 minutes, ≧60 minutes, ≧90 minutes, ≧120 minutes, ≧150 minutes, ≧180 minutes, ≧210 minutes, or ≧240 minutes. In some embodiments, the level of each measured biomarker protein is determined from relative fluorescence units (RFU) or protein concentration.

いくつかの実施形態において、対象がこれまでにタバコ使用者であった確率の予測は、統計モデルにおいて測定されたＮ個のバイオマーカータンパク質のレベルの入力に基づく。いくつかの実施形態において、決定は、エラスティックネットロジスティック回帰モデルを使用して、Ｎ個のバイオマーカータンパク質のレベルを分析することを含む。いくつかの実施形態において、モデルは、少なくとも０．６５、少なくとも０．６６、少なくとも０．６７、少なくとも０．６８、少なくとも０．６９、少なくとも０．７、少なくとも０．７５、少なくとも０．８、少なくとも０．８５、少なくとも０．９、または少なくとも０．９５から選択される曲線下面積（ＡＵＣ）を有する。いくつかの実施形態において、モデルは、これまでにタバコ使用者であった絶対リスク確率を提供する。いくつかの実施形態において、モデルは、０～１の間の値を提供し、そこで、＞０．４６０の値は、これまでにタバコ使用者であることを予測するものである。いくつかの実施形態において、モデルは、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、及びＣＤ２４８から選択されるバイオマーカータンパク質の各々のレベルに基づいて、これまでにタバコ使用者であった絶対確率を提供する。いくつかの実施形態において、モデルは、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択されるバイオマーカータンパク質の各々のレベルに基づいて、これまでにタバコ使用者であった絶対確率を提供する。In some embodiments, the prediction of the probability that a subject is an ever tobacco user is based on inputting the measured levels of N biomarker proteins into a statistical model. In some embodiments, the determination involves analyzing the levels of the N biomarker proteins using an elastic net logistic regression model. In some embodiments, the model has an area under the curve (AUC) selected from at least 0.65, at least 0.66, at least 0.67, at least 0.68, at least 0.69, at least 0.7, at least 0.75, at least 0.8, at least 0.85, at least 0.9, or at least 0.95. In some embodiments, the model provides an absolute risk probability of ever being a tobacco user. In some embodiments, the model provides a value between 0 and 1, where a value >0.460 is predictive of ever being a tobacco user. In some embodiments, the model provides an absolute probability of being an ever tobacco user based on the level of each of the biomarker proteins selected from EPHA6, placental alkaline phosphatase, SCF, MASP3:heavy chain, DUS10, IgG4-κ, DCC, RCAN3, and CD248. In some embodiments, the model provides an absolute probability of ever being a tobacco user based on the level of each of the biomarker proteins selected from EPHA6, placental alkaline phosphatase, SCF, MASP3:heavy chain, DUS10, IgG4-κ, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, secretoglobin family 3A member 1, SOX2, SREC-II, IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、方法は、医療保険料または生命保険料の決定の目的で、対象がこれまでにタバコ使用者であった確率を予測するために提供される。いくつかの実施形態において、方法は、医療保険または生命保険の補償範囲または保険料を決定することをさらに含む。いくつかの実施形態において、方法は、医療資源の利用を予測及び／または管理するために、方法からもたらされる情報を使用することをさらに含んで提供される。いくつかの実施形態において、方法は、医療事業、病院、または会社を取得または購入する決定を可能にするために、方法からもたらされる情報を使用することをさらに含む。実施形態のうちの任意のものにおいて、タバコ使用者は喫煙者である。In some embodiments, a method is provided for predicting the probability that a subject is a former tobacco user for purposes of determining health insurance or life insurance premiums. In some embodiments, the method further comprises determining health insurance or life insurance coverage or premiums. In some embodiments, a method is provided further comprising using information resulting from the method to predict and/or manage healthcare resource utilization. In some embodiments, the method further comprises using information resulting from the method to facilitate decisions to acquire or purchase healthcare businesses, hospitals, or companies. In any of the embodiments, the tobacco user is a smoker.

いくつかの実施形態において、Ｎ個のバイオマーカータンパク質捕捉試薬を含むキットであって、Ｎが、少なくとも１、少なくとも２、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも３１、少なくとも３２、少なくとも３３、少なくとも３４、少なくとも３５、少なくとも３６、少なくとも３７、少なくとも３８、または少なくとも３９であり、Ｎ個のバイオマーカータンパク質捕捉試薬の少なくとも１つが、ＥＰＨＡ６、胎盤型アルカリホスファターゼ、ＳＣＦ、ＭＡＳＰ３：重鎖、ＤＵＳ１０、ＩｇＧ４－κ、ＤＣＣ、ＲＣＡＮ３、ＣＤ２４８、ＥＤＩＬ３、レニン、ＰＳＰ－９４、ＰＩＧＲ、ＵＲＢ、アグリン、セクレトグロビンファミリー３Ａメンバー１、ＳＯＸ２、ＳＲＥＣ－ＩＩ、ＩＧＦＡＬＳ、ＳＬＰＩ、ＷＦＤＣ１、ＲＥＧ４、ＭＭＰ－８、ＬＰＬＣ１、ＮＥＴ４、ＰＳＰ、ＤＬＤＨ、ＴＣＰ１０、ＭＵＣ１８、ＴＡＧＬ、ＴＩＭＰ－４、ＦＡＭ３Ｂ、フィビュリン１、ＣＲＡＣ１、ＰＰＢＮ、６Ｃｋｉｎｅ、カテプシンＶ、ＨＥ４、及びＰＰ２ＡサブユニットＢから選択されるバイオマーカータンパク質へ特異的に結合する、前記キットが提供される。In some embodiments, a kit includes N biomarker protein capture reagents, where N is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, or at least 38 , or at least 39, and at least one of the N biomarker protein capture reagents is EPHA6, placental alkaline phosphatase, SCF, MASP3: heavy chain, DUS10, IgG4-κ, DCC, RCAN3, CD248, EDIL3, renin, PSP-94, PIGR, URB, agrin, secretoglobin family 3A member 1, SOX2, SREC-II The kit specifically binds to a biomarker protein selected from the group consisting of IGFALS, SLPI, WFDC1, REG4, MMP-8, LPLC1, NET4, PSP, DLDH, TCP10, MUC18, TAGL, TIMP-4, FAM3B, fibulin 1, CRAC1, PPBN, 6Ckine, cathepsin V, HE4, and PP2A subunit B.

いくつかの実施形態において、Ｎ個のバイオマーカータンパク質捕捉試薬を含むキットであって、本明細書において記載される方法のうちの任意のものを実行するためのバイオマーカータンパク質捕捉試薬を含む、前記キットが提供される。いくつかの実施形態において、Ｎ個のバイオマーカータンパク質捕捉試薬の各々は、抗体またはアプタマーである。いくつかの実施形態において、各々のバイオマーカータンパク質捕捉試薬は、アプタマーである。いくつかの実施形態において、少なくとも１つのアプタマーは、解離速度の遅いアプタマーである。いくつかの実施形態において、少なくとも１つの解離速度の遅いアプタマーは、少なくとも１つ、少なくとも２つ、少なくとも３、少なくとも４、少なくとも５、少なくとも６、少なくとも７、少なくとも８、少なくとも９、または少なくとも１０の修飾を備えたヌクレオチドを含む。いくつかの実施形態において、各々の解離速度の遅いアプタマーは、≧２０分、≧３０分、≧６０分、≧９０分、≧１２０分、≧１５０分、≧１８０分、≧２１０分、または≧２４０分の解離速度（ｔ_１／２）で、その標的タンパク質へ結合する。いくつかの実施形態において、キットは、対象からのサンプル中のＮ個のバイオマーカータンパク質の検出における使用ためのものである。いくつかの実施形態において、キットは、対象がこれまでにタバコ使用者であった確率の予測での使用における使用のためのものである。 In some embodiments, a kit is provided comprising N biomarker protein capture reagents for carrying out any of the methods described herein. In some embodiments, each of the N biomarker protein capture reagents is an antibody or an aptamer. In some embodiments, each biomarker protein capture reagent is an aptamer. In some embodiments, at least one aptamer is a slow-dissociating aptamer. In some embodiments, at least one slow-dissociating aptamer comprises at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, or at least ten modified nucleotides. In some embodiments, each slow-dissociating aptamer binds to its target protein with a dissociation rate (t_1/2 ) of ≥ 20 minutes, ≥ 30 minutes, ≥ 60 minutes, ≥ 90 minutes, ≥ 120 minutes, ≥ 150 minutes, ≥ 180 minutes, ≥ 210 minutes, or ≥ 240 minutes. In some embodiments, the kit is for use in detecting N biomarker proteins in a sample from a subject, hi some embodiments, the kit is for use in predicting the probability that a subject is an ever tobacco user.

訓練、検証、及び妥当性確認のデータセットに関するタバコ使用状態参照モデルについての受信者動作特性（ＲＯＣ）曲線を示す。1 shows receiver operating characteristic (ROC) curves for the tobacco use status reference model for the training, validation, and verification datasets.データセット（訓練、検証、または妥当性確認）によって層別化された予測確率のボックスプロット、及びタバコ使用が「これまでにあった」ｖｓ「１度もない」の分類を示す。水平方向の黒色線は０．４６の決定カットオフに対応する。Boxplots of predicted probabilities stratified by dataset (training, validation, or validation) and classification of tobacco use as "ever" vs. "never." The horizontal black line corresponds to a decision cutoff of 0.46.ウリジンの５位で存在し得る、アプタマー中の特定の例示的な修飾を示す。Ｃ－５修飾の化学構造は、ウリジンの５位への修飾を連結する例示的なアミドリンケージを含む。示される５位部分としては、ベンジル部分（例えばＢｎ、ＰＥ、及びＰＰ）、ナフチル部分（例えばＮａｐ、２Ｎａｐ、ＮＥ）、ブチル部分（例えばｉＢｕ）、フルオロベンジル部分（例えばＦＢｎ）、チロシル部分（例えばＴｙｒ）、３，４－メチレンジオキシベンジル（例えばＭＢｎ）、モルホリノ部分（例えばＭＯＥ）、ベンゾフラニル部分（例えばＢＦ）、インドール部分（例えばＴｒｐ）、及びヒドロキシプロピル部分（例えばＴｈｒ）が挙げられる。Certain exemplary modifications in aptamers that may be present at the 5-position of uridine are shown. The chemical structures of C-5 modifications include exemplary amide linkages connecting the modifications to the 5-position of uridine. The 5-position moieties shown include benzyl moieties (e.g., Bn, PE, and PP), naphthyl moieties (e.g., Nap, 2Nap, NE), butyl moieties (e.g., iBu), fluorobenzyl moieties (e.g., FBn), tyrosyl moieties (e.g., Tyr), 3,4-methylenedioxybenzyl (e.g., MBn), morpholino moieties (e.g., MOE), benzofuranyl moieties (e.g., BF), indole moieties (e.g., Trp), and hydroxypropyl moieties (e.g., Thr).ウリジンの５位で存在し得る、アプタマー中の特定の例示的な修飾を示す。Ｃ－５修飾の化学構造は、ウリジンの５位への修飾を連結する例示的なアミドリンケージを含む。示される５位部分としては、ベンジル部分（例えばＢｎ、ＰＥ、及びＰＰ）、ナフチル部分（例えばＮａｐ、２Ｎａｐ、ＮＥ）、ブチル部分（例えばｉＢｕ）、フルオロベンジル部分（例えばＦＢｎ）、チロシル部分（例えばＴｙｒ）、３，４－メチレンジオキシベンジル（例えばＭＢｎ）、モルホリノ部分（例えばＭＯＥ）、ベンゾフラニル部分（例えばＢＦ）、インドール部分（例えばＴｒｐ）、及びヒドロキシプロピル部分（例えばＴｈｒ）が挙げられる。Certain exemplary modifications in aptamers that may be present at the 5-position of uridine are shown. The chemical structures of C-5 modifications include exemplary amide linkages connecting the modifications to the 5-position of uridine. The 5-position moieties shown include benzyl moieties (e.g., Bn, PE, and PP), naphthyl moieties (e.g., Nap, 2Nap, NE), butyl moieties (e.g., iBu), fluorobenzyl moieties (e.g., FBn), tyrosyl moieties (e.g., Tyr), 3,4-methylenedioxybenzyl (e.g., MBn), morpholino moieties (e.g., MOE), benzofuranyl moieties (e.g., BF), indole moieties (e.g., Trp), and hydroxypropyl moieties (e.g., Thr).ウリジンの５位で存在し得る、アプタマー中の特定の例示的な修飾を示す。Ｃ－５修飾の化学構造は、ウリジンの５位への修飾を連結する例示的なアミドリンケージを含む。示される５位部分としては、ベンジル部分（例えばＢｎ、ＰＥ、及びＰＰ）、ナフチル部分（例えばＮａｐ、２Ｎａｐ、ＮＥ）、ブチル部分（例えばｉＢｕ）、フルオロベンジル部分（例えばＦＢｎ）、チロシル部分（例えばＴｙｒ）、３，４－メチレンジオキシベンジル（例えばＭＢｎ）、モルホリノ部分（例えばＭＯＥ）、ベンゾフラニル部分（例えばＢＦ）、インドール部分（例えばＴｒｐ）、及びヒドロキシプロピル部分（例えばＴｈｒ）が挙げられる。Certain exemplary modifications in aptamers that may be present at the 5-position of uridine are shown. The chemical structures of C-5 modifications include exemplary amide linkages connecting the modifications to the 5-position of uridine. The 5-position moieties shown include benzyl moieties (e.g., Bn, PE, and PP), naphthyl moieties (e.g., Nap, 2Nap, NE), butyl moieties (e.g., iBu), fluorobenzyl moieties (e.g., FBn), tyrosyl moieties (e.g., Tyr), 3,4-methylenedioxybenzyl (e.g., MBn), morpholino moieties (e.g., MOE), benzofuranyl moieties (e.g., BF), indole moieties (e.g., Trp), and hydroxypropyl moieties (e.g., Thr).シチジンの５位で存在し得る、アプタマー中の特定の例示的な修飾を示す。Ｃ－５修飾の化学構造は、シチジンの５位への修飾を連結する例示的なアミドリンケージを含む。示される５位部分としては、ベンジル部分（例えばＢｎ、ＰＥ、及びＰＰ）、ナフチル部分（例えばＮａｐ、２Ｎａｐ、ＮＥ、及び２ＮＥ）、及びチロシル部分（例えばＴｙｒ）が挙げられる。Certain exemplary modifications in aptamers that may occur at the 5-position of cytidine are shown. The chemical structures of C-5 modifications include exemplary amide linkages connecting the modifications to the 5-position of cytidine. The 5-position moieties shown include benzyl moieties (e.g., Bn, PE, and PP), naphthyl moieties (e.g., Nap, 2Nap, NE, and 2NE), and tyrosyl moieties (e.g., Tyr).シチジンの５位で存在し得る、アプタマー中の特定の例示的な修飾を示す。Ｃ－５修飾の化学構造は、シチジンの５位への修飾を連結する例示的なアミドリンケージを含む。示される５位部分としては、ベンジル部分（例えばＢｎ、ＰＥ、及びＰＰ）、ナフチル部分（例えばＮａｐ、２Ｎａｐ、ＮＥ、及び２ＮＥ）、及びチロシル部分（例えばＴｙｒ）が挙げられる。Certain exemplary modifications in aptamers that may occur at the 5-position of cytidine are shown. The chemical structures of C-5 modifications include exemplary amide linkages connecting the modifications to the 5-position of cytidine. The 5-position moieties shown include benzyl moieties (e.g., Bn, PE, and PP), naphthyl moieties (e.g., Nap, 2Nap, NE, and 2NE), and tyrosyl moieties (e.g., Tyr).本明細書において記載される様々なコンピューター実装方法と共に使用するための例示的なコンピューターシステムを図示する。1 illustrates an exemplary computer system for use with various computer-implemented methods described herein.１つまたは複数の実施形態に従って、個体がこれまでにタバコ使用者であった確率を評価する方法についてのフローチャートである。1 is a flowchart for a method of assessing the probability that an individual is a former tobacco user according to one or more embodiments.

本発明の代表的な実施形態が、ここで詳細に参照されるだろう。本発明は列挙された具体的な実施形態と共に記載されるだろうが、本発明はそれらの実施形態に限定されることを意図するものではないことが理解されるだろう。これとは反対に、本発明は、請求項によって定義される本発明の範囲内に包含され得るすべての代替物、修飾物、及び均等物をカバーすることが意図される。Reference will now be made in detail to exemplary embodiments of the present invention. While the invention will be described in conjunction with enumerated specific embodiments, it will be understood that it is not intended that the invention be limited to those embodiments. On the contrary, the invention is intended to cover all alternatives, modifications, and equivalents which may be included within the scope of the present invention as defined by the claims.

当業者は本明細書において記載されるものに類似または均等な多くの方法及び材料を認識し、それらは本発明の実践において使用され得、本発明の実践の範囲内である。本発明は、記載される方法及び材料に一切限定されない。Those skilled in the art will recognize many methods and materials similar or equivalent to those described herein, which could be used in the practice of the present invention and are within the scope of the present invention. The present invention is in no way limited to the methods and materials described.

別段定義されない限り、本明細書において使用される技術用語及び科学用語は、本発明が属する技術分野の当業者によって共通して理解されるものと同じ意味を有する。本発明の実践または検査において、本明細書において記載されるものに類似または均等の任意の方法、デバイス、及び材料が使用され得るが、特定の方法、デバイス、及び材料がここで記載される。Unless otherwise defined, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods, devices, and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, specific methods, devices, and materials are described herein.

本出願において引用されたすべての出版物、公開された特許文書、及び特許出願は、本出願が属する技術分野（複数可）における技能レベルを表す。本明細書におけて引用されたすべての出版物、公開された特許文書、及び特許出願は、あたかもの各々の個別の出版物、公開された特許文書、または特許出願が参照によって援用されることが具体的且つ個別に指示されたかのように、それと同じ程度に参照によって本明細書に援用される。All publications, published patent documents, and patent applications cited in this application are indicative of the level of skill in the technical field(s) to which this application pertains. All publications, published patent documents, and patent applications cited herein are hereby incorporated by reference to the same extent as if each individual publication, published patent document, or patent application was specifically and individually indicated to be incorporated by reference.

添付の特許請求の範囲を含め、本出願において使用される時、単数形の「ａ」、「ａｎ」、及び「ｔｈｅ」は、別段内容が明確に指示しない限り、複数の参照を包含し、「少なくとも１つ」及び「１つまたは複数」と互換的に使用される。したがって、「ＳＯＭＡｍｅｒ」への参照はＳＯＭＡｍｅｒの混合物を包含し、「プローブ」への参照はプローブの混合物を包含する、などである。As used in this application, including the appended claims, the singular forms "a," "an," and "the" include plural references unless the content clearly dictates otherwise, and are used interchangeably with "at least one" and "one or more." Thus, a reference to a "SOMAmer" includes a mixture of SOMAmers, a reference to a "probe" includes a mixture of probes, etc.

本明細書において使用される時、「約」という用語は、数値が関連する項目の基本機能が不変であるような、数値の有意でない修飾またはバリエーションを表わす。As used herein, the term "about" refers to an insignificant modification or variation of a numerical value such that the basic function of the item to which the numerical value relates remains unchanged.

本明細書において使用される時、「含む」、「含むこと」、「包含する」、「包含すること」、「含有する」、「含有すること」という用語、及びその任意の変化形は、非排他的な包含をカバーすることが意図され、その結果、要素、または要素のリストを含むか、包含するか、または含有するプロセス、方法、プロダクト・バイ・ブロセス、または物質の組成は、それらの要素を包含するだけではく、明示的にリストされない他の要素、またはかかるプロセス、方法、プロダクト・バイ・ブロセス、もしくは物質の組成に固有も他の要素を包含し得る。As used herein, the terms "comprise," "including," "including," "comprising," "containing," "containing," and any variations thereof are intended to cover a non-exclusive inclusion, such that a process, method, product-by-process, or composition of matter that includes, includes, or contains elements, or a list of elements, does not include only those elements, but may also include other elements not expressly listed or inherent in such process, method, product-by-process, or composition of matter.

本出願は、個体がこれまでにタバコ使用者であった確率を決定するためのバイオマーカー、方法、デバイス、試薬、システム、及びキットを包含する。This application encompasses biomarkers, methods, devices, reagents, systems, and kits for determining the probability that an individual is a former tobacco user.

「タバコ使用」は、本明細書において使用される時、燃焼式タバコ及び噛みタバコを包含する。「タバコ使用」は、本明細書において使用される時、電子タバコを包含しない。"Tobacco use," as used herein, includes combustible tobacco and chewing tobacco. "Tobacco use," as used herein, does not include e-cigarettes.

「生物学的サンプル」、「サンプル」、及び「検査サンプル」は、個体から得られたか、またはさもなければ個体に由来する任意の材料、生物学的液体、組織、または細胞を指すように、本明細書において互換的に使用される。当該用語は、血液（全血液、白血球、末梢血単核細胞、バフィーコート、血漿、及び血清が挙げられる）、乾燥血液スポット（例えば乳仔から得られた）、喀痰、涙液、粘液、鼻洗浄液、鼻吸引液、呼気、尿、精液、唾液、腹腔洗浄液、腹水、嚢胞液、髄膜液、羊水、腺液、膵液、リンパ液、胸水、乳頭吸引液、気管支吸引液、気管支擦過ブラシ、滑液、関節吸引液、器官分泌物、細胞、細胞抽出物、及び脳脊髄液を包含する。当該用語は、前述のもののすべての実験的に分離された画分も包含する。例えば、血液サンプルは、血清、血漿、または赤血球もしくは白血球（ｗｈｉｔｅｂｌｏｏｄｃｅｌｌ）（白血球（ｌｅｕｋｏｃｙｔｅ））等の特定のタイプの血液細胞を含有する画分へと分画され得る。所望されるならば、サンプルは、個体からのサンプルの組み合わせ（組織サンプル及び液体のサンプルの組み合わせ等）であり得る。「生物学的サンプル」という用語は、ホモジナイズされた固体材料（例えば便サンプル、組織サンプル、または組織生検から等）を含有する材料も包含する。「生物学的サンプル」という用語は、組織培養または細胞培養に由来する材料も包含する。任意の生物学的サンプルを得るための好適な方法が用いられ得る。例示的な方法としては、例えば静脈切開、スワブ（例えばバッカルスワブ）、及び微細針吸引生検手順が挙げられる。微細針吸引を受けやすい例示的な組織としては、リンパ節、肺、肺洗浄液、ＢＡＬ（気管支肺胞洗浄液）、甲状腺、乳房、膵臓、及び肝臓が挙げられる。サンプルは、例えばマイクロダイセクション（例えばレーザーキャプチャーマイクロダイセクション（ＬＣＭ）またはレーザーマイクロダイセクション（ＬＭＤ））、膀胱洗浄、スメア（例えばＰＡＰスメア）、または乳管洗浄によっても収集され得る。個体から得られたか、または個体に由来する「生物学的サンプル」は、個体から得られた後に任意の好適な様式でプロセシングされた任意のかかるサンプルを包含する。"Biological sample," "sample," and "test sample" are used interchangeably herein to refer to any material, biological fluid, tissue, or cell obtained from or otherwise derived from an individual. The terms include blood (including whole blood, leukocytes, peripheral blood mononuclear cells, buffy coat, plasma, and serum), dried blood spots (e.g., from an infant), sputum, tears, mucus, nasal washes, nasal aspirates, exhaled breath, urine, semen, saliva, peritoneal washings, ascites, cyst fluid, cerebrospinal fluid, amniotic fluid, glandular fluid, pancreatic juice, lymphatic fluid, pleural effusion, nipple aspirate, bronchial aspirate, bronchial brush, synovial fluid, joint aspirate, organ secretions, cells, cell extracts, and cerebrospinal fluid. The terms also include experimentally isolated fractions of all of the foregoing. For example, a blood sample can be fractionated into serum, plasma, or fractions containing specific types of blood cells, such as red blood cells or white blood cells (leukocytes). If desired, a sample can be a combination of samples from an individual, such as a combination of a tissue sample and a fluid sample. The term "biological sample" also encompasses materials containing homogenized solid material (e.g., from a stool sample, tissue sample, or tissue biopsy). The term "biological sample" also encompasses materials derived from tissue or cell culture. Any suitable method for obtaining a biological sample can be used. Exemplary methods include, for example, phlebotomy, swabs (e.g., buccal swabs), and fine needle aspiration biopsy procedures. Exemplary tissues amenable to fine needle aspiration include lymph nodes, lung, lung lavage fluid, BAL (bronchoalveolar lavage fluid), thyroid, breast, pancreas, and liver. Samples may also be collected, for example, by microdissection (e.g., laser capture microdissection (LCM) or laser microdissection (LMD)), bladder washing, smear (e.g., PAP smear), or breast ductal lavage. A "biological sample" obtained from or derived from an individual includes any such sample that has been processed in any suitable manner after being obtained from the individual.

本明細書の目的のために、「個体からの生物学的サンプルに帰属するデータ」という語句は、個体の生物学的サンプルに由来するか、または、個体の生物学的サンプルを使用して生成された、いくつかの形態におけるデータを意味することが意図される。データは、生成された後に、再フォーマットされ得るか、修正され得るか、またはある測定システムでの単位から別の測定システムでの単位への変換等によってある程度まで数学的に変更され得るが、データは、生物学的サンプルに由来するか、または生物学的サンプルを使用して生成されたと理解される。For purposes of this specification, the phrase "data attributable to a biological sample from an individual" is intended to mean data in some form that is derived from or generated using an individual's biological sample. Although the data may be reformatted, modified, or mathematically altered to some extent after it is generated, such as by converting units from one measurement system to another, the data is understood to be derived from or generated using a biological sample.

「標的」、「標的分子」、及び「分析物」は、生物学的サンプル中に存在し得る関心のある任意の分子を指すように本明細書において互換的に使用される。「関心のある分子」は、特定の分子の任意の軽微なバリエーション（タンパク質の事例において、例えばアミノ酸配列におけるわずかなバリエーション、ジスルフィド結合形成、糖鎖付加、脂質付加、アセチル化、リン酸化等）、または分子の同一性を実質的に変更しない他のマニピュレーションもしくは修飾（標識構成要素によるコンジュゲーション等）を包含する。「標的分子」、「標的」、または「分析物」は、１つのタイプまたは種の分子または多分子構造のセットのコピーである。「複数の標的分子」、「複数の標的」、及び「複数の分析物」は、１を超えるかかる分子のセットを指す。例示的な標的分子としては、タンパク質、ポリペプチド、核酸、炭水化物、脂質、多糖、糖タンパク質、ホルモン、受容体、抗原、抗体、アフィボディ、抗体模倣体、ウイルス、病原体、毒性物質、基質、代謝物質、遷移状態類似体、コファクター、阻害物質、薬物、色素、栄養素、増殖因子、細胞、組織、及び上記のうちの任意のものの断片または部分が挙げられる。いくつかの実施形態において、標的分子はタンパク質であり、その事例において、標的分子は「標的タンパク質」と称され得る。"Target," "target molecule," and "analyte" are used interchangeably herein to refer to any molecule of interest that may be present in a biological sample. A "molecule of interest" encompasses any minor variations of a particular molecule (e.g., in the case of proteins, slight variations in amino acid sequence, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, etc.) or other manipulations or modifications (e.g., conjugation with a labeling component) that do not substantially alter the identity of the molecule. A "target molecule," "target," or "analyte" is a set of copies of one type or species of molecule or multimolecular structure. "Multiple target molecules," "multiple targets," and "multiple analytes" refer to a set of more than one such molecule. Exemplary target molecules include proteins, polypeptides, nucleic acids, carbohydrates, lipids, polysaccharides, glycoproteins, hormones, receptors, antigens, antibodies, affibodies, antibody mimetics, viruses, pathogens, toxic substances, substrates, metabolites, transition state analogs, cofactors, inhibitors, drugs, dyes, nutrients, growth factors, cells, tissues, and fragments or portions of any of the above. In some embodiments, the target molecule is a protein, in which case the target molecule may be referred to as a "target protein."

本明細書において使用される時、「捕捉剤」または「捕捉試薬」は、バイオマーカーへ特異的に結合することが可能な分子を指す。「標的タンパク質捕捉試薬」は、標的タンパク質へ特異的に結合することが可能な分子を指す。非限定的な例示的な捕捉試薬としては、アプタマー、抗体、アドネクチン、アンキリン、他の抗体模倣体及び他のタンパク質スキャフォールド、自己抗体、キメラ、小分子、核酸、レクチン、リガンド結合受容体、インプリントポリマー、アビマー、ペプチド模倣体、ホルモン受容体、サイトカイン受容体、合成受容体、ならびに前述の捕捉試薬のうちの任意のものの修飾物及び断片が挙げられる。いくつかの実施形態において、捕捉試薬は、アプタマー及び抗体から選択される。As used herein, "capture agent" or "capture reagent" refers to a molecule capable of specifically binding to a biomarker. "Target protein capture reagent" refers to a molecule capable of specifically binding to a target protein. Non-limiting exemplary capture reagents include aptamers, antibodies, adnectins, ankyrins, other antibody mimetics and other protein scaffolds, autoantibodies, chimeras, small molecules, nucleic acids, lectins, ligand-binding receptors, imprinted polymers, avimers, peptidomimetics, hormone receptors, cytokine receptors, synthetic receptors, and modifications and fragments of any of the foregoing capture reagents. In some embodiments, the capture reagent is selected from an aptamer and an antibody.

本明細書において使用される時、「ポリペプチド」、「ペプチド」、及び「タンパク質」は、任意の長さのアミノ酸のポリマーを指すように本明細書において互換的に使用される。ポリマーは、直鎖状または分岐状であり得、修飾アミノ酸を含み得、非アミノ酸によって中断され得る。当該用語は、また天然にまたは介入によって修飾されたアミノ酸ポリマーを網羅し、例えばジスルフィド結合形成、糖鎖付加、脂質付加、アセチル化、リン酸化、または他のマニピュレーションもしくは修飾（標識構成要素によるコンジュゲーション等）である。例えば１つまたは複数のアミノ酸の類似体（例えば非天然アミノ酸などが挙げられる）そして当技術分野において公知の他の修飾を含有するポリペプチドも定義内に包含される。ポリペプチドは、単鎖または会合された鎖であり得る。プレタンパク質及びインタクトな成熟タンパク質；成熟タンパク質に由来するペプチドまたはポリペプチド；タンパク質の断片；スプライスバリアント；タンパク質の組み換え形態；アミノ酸の修飾、欠失、または置換のあるタンパク質バリアント；消化物；及び翻訳後修飾（糖鎖付加、アセチル化、リン酸化、及び同種のもの等）も定義内に包含される。As used herein, the terms "polypeptide," "peptide," and "protein" are used interchangeably to refer to polymers of amino acids of any length. Polymers can be linear or branched, can contain modified amino acids, and can be interrupted by non-amino acids. The terms also encompass amino acid polymers that are modified naturally or by intervention, such as disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or other manipulation or modification (such as conjugation with a labeling component). Also included within the definition are polypeptides containing one or more analogs of an amino acid (including, for example, unnatural amino acids) and other modifications known in the art. Polypeptides can be single chains or associated chains. Also included within the definition are preproteins and intact mature proteins; peptides or polypeptides derived from mature proteins; fragments of proteins; splice variants; recombinant forms of proteins; protein variants with amino acid modifications, deletions, or substitutions; digests; and post-translational modifications (such as glycosylation, acetylation, phosphorylation, and the like).

「抗体」という用語は、任意の種の全長抗体、ならびにかかる抗体の断片及び誘導体（Ｆａｂ断片及びＦ（ａｂ’）_２断片、一本鎖抗体、Ｆｖ断片、及び一本鎖Ｆｖ断片が挙げられる）を指す。「抗体」という用語は、合成に由来する抗体（ファージディスプレイ由来抗体及び断片、アフィボディ、ナノボディ等）も指す。 The term "antibody" refers to full-length antibodies of any species, as well as fragments and derivatives of such antibodies, including Fab and F(ab')₂ fragments, single chain antibodies, Fv fragments, and single chain Fv fragments. The term "antibody" also refers to synthetically derived antibodies, such as phage display-derived antibodies and fragments, affibodies, nanobodies, etc.

本明細書において使用される時、「マーカー」及び「バイオマーカー」及び「フィーチャ」は、個体における正常もしくは異常なプロセスまたは個体における疾患もしくは他の病態を示すか、あるいはそれらの徴候である標的分子を指すように互換的に使用される。より具体的には、「マーカー」または「バイオマーカー」または「フィーチャ」は、正常または異常にかかわらず、そして異常ならば、慣性または急性にかかわらず、特定の生理的な状態またはプロセスの存在と関連する、解剖学的、生理学的、生化学的、または分子的なパラメーターである。バイオマーカーは、様々な方法（検査室アッセイ及び医療用イメージングが挙げられる）によって検出可能及び測定可能である。バイオマーカーがタンパク質である場合に、生物学的サンプル中の対応するタンパク質バイオマーカーの量または存在もしくは非存在の代替的測定値として対応する遺伝子の発現、あるいはバイオマーカーまたはバイオマーカーの発現を制御するタンパク質をコードする遺伝子のメチル化状態を使用することも可能である。ある特定の態様において、フィーチャは、統計モデルにおける他の予測因子である分析物／ＳＯＭＡｍｅｒ試薬である。As used herein, "marker," "biomarker," and "feature" are used interchangeably to refer to a target molecule that is indicative of or symptomatic of a normal or abnormal process in an individual, or a disease or other pathological condition in an individual. More specifically, a "marker," "biomarker," or "feature" is an anatomical, physiological, biochemical, or molecular parameter that correlates with the presence of a particular physiological state or process, whether normal or abnormal, and if abnormal, whether inert or acute. Biomarkers are detectable and measurable by a variety of methods, including laboratory assays and medical imaging. When a biomarker is a protein, it is also possible to use the expression of the corresponding gene or the methylation status of the gene encoding the biomarker or the protein that controls the expression of the biomarker as a surrogate measure of the amount or presence or absence of the corresponding protein biomarker in a biological sample. In certain embodiments, a feature is an analyte/SOMAmer reagent that is another predictor in a statistical model.

本明細書において使用される時、「バイオマーカー値」、「値」、「バイオマーカーレベル」、「フィーチャレベル」、及び「レベル」は、生物学的サンプル中のバイオマーカーの検出のための任意の分析方法を使用して行われ、生物学的サンプル中のバイオマーカーの、当該バイオマーカーについての、または当該バイオマーカーに対応する、存在、非存在、絶対量もしくは濃度、相対量もしくは濃度、力価、レベル、発現レベル、測定されたレベルの比率、または同種のものを示す測定値を指すように互換的に使用される。「値」または「レベル」の正確な性質は、バイオマーカーを検出するために用いられる特定の分析方法の具体的なデザイン及び構成要素に依存する。As used herein, "biomarker value," "value," "biomarker level," "feature level," and "level" are used interchangeably to refer to a measurement of a biomarker in a biological sample, performed using any analytical method for the detection of a biomarker in the biological sample, that indicates the presence, absence, absolute amount or concentration, relative amount or concentration, titer, level, expression level, ratio of measured levels, or the like, of, or corresponding to, that biomarker. The exact nature of the "value" or "level" will depend on the specific design and components of the particular analytical method used to detect the biomarker.

バイオマーカーが、個体における異常なプロセスまたは疾患もしくは他の病態を示すか、あるいはそれらの徴候である場合に、そのバイオマーカーは、概して、個体における正常なプロセスまたは疾患もしくは他の病態の非存在を示すか、あるいはそれらの徴候であるバイオマーカーの発現レベルまたは値に比較して、過剰発現または過少発現されるのいずれかとして記載される。「アップレギュレーション」、「アップレギュレートされた」、「過剰発現」、「過剰発現された」、及び任意のそれらの変化形は、健康または正常な個体からの類似する生物学的サンプルにおいて典型的には検出されるバイオマーカーの値またはレベル（または値もしくはレベルの範囲）を超える、生物学的サンプル中のバイオマーカーの値またはレベルを指すように互換的に使用される。当該用語は、特定の疾患の異なるステージで検出され得るバイオマーカーの値またはレベル（または値もしくはレベルの範囲）を超える、生物学的サンプル中のバイオマーカーの値またはレベルも指し得る。When a biomarker is indicative of or symptomatic of an abnormal process or disease or other condition in an individual, the biomarker is generally described as being either overexpressed or underexpressed relative to an expression level or value of the biomarker that is indicative of or symptomatic of the absence of a normal process or disease or other condition in the individual. "Upregulation," "upregulated," "overexpression," "overexpressed," and any variations thereof are used interchangeably to refer to a value or level of the biomarker in a biological sample that exceeds the value or level (or range of values or levels) of the biomarker typically detected in a similar biological sample from a healthy or normal individual. The term may also refer to a value or level of the biomarker in a biological sample that exceeds the value or level (or range of values or levels) that can be detected at different stages of a particular disease.

「ダウンレギュレーション」、「ダウンレギュレートされた」、「過少発現」、「過少発現された」、及び任意のそれらの変化形は、健康または正常な個体からの類似する生物学的サンプルにおいて典型的には検出されるバイオマーカーの値またはレベル（または値もしくはレベルの範囲）未満である、生物学的サンプル中のバイオマーカーの値またはレベルを指すように互換的に使用される。当該用語は、特定の疾患の異なるステージで検出され得るバイオマーカーの値またはレベル（または値もしくはレベルの範囲）未満である、生物学的サンプル中のバイオマーカーの値またはレベルも指し得る。"Downregulation," "downregulated," "underexpression," "underexpressed," and any variations thereof, are used interchangeably to refer to a value or level of a biomarker in a biological sample that is less than the value or level (or range of values or levels) of the biomarker typically detected in a similar biological sample from a healthy or normal individual. The terms may also refer to a value or level of a biomarker in a biological sample that is less than the value or level (or range of values or levels) of the biomarker that may be detected at different stages of a particular disease.

さらに、過剰発現されるかまたは過少発現されるのいずれかであるバイオマーカーは、個体における正常なプロセスまたは疾患もしくは他の病態の非存在を示すか、あるいはそれらの徴候であるバイオマーカーの「正常な」発現レベルまたは値に比較して、「差次的に発現される」または「差次的なレベル」もしくは「差次的な値」を有するとも称され得る。したがって、バイオマーカーの「差次的発現」は、バイオマーカーの「正常な」発現レベルからの変動とも称され得る。Furthermore, a biomarker that is either overexpressed or underexpressed may also be referred to as being "differentially expressed" or having a "differential level" or "differential value" compared to a "normal" expression level or value of the biomarker that is indicative of, or symptomatic of, a normal process or the absence of a disease or other pathological condition in an individual. Thus, "differential expression" of a biomarker may also be referred to as a variation from the "normal" expression level of the biomarker.

「差次的遺伝子発現」及び「差次的発現」という用語は、特定の疾患または病態に罹患する対象において、その発現が、正常または対照の対象におけるその発現に比べて、より高いかまたはより低いレベルへ活性化される遺伝子（またはその対応するタンパク質発現産物）を指すように互換的に使用される。当該用語は、その発現が同じ疾患または病態の異なるステージでより高いかまたはより低いレベルへ活性化される遺伝子（または対応するタンパク質発現産物）も包含する。差次的に発現される遺伝子は、核酸レベルもしくはタンパク質レベルで活性化もしくは阻害され得るか、または異なるポリペプチド産物をもたらすようにオルタナティブスプライシングを受け得るかのいずれかであることも理解されたい。かかる差は、様々な変化（ｍＲＮＡのレベル、表面発現、分泌、またはポリペプチドの他の分配を包含する）によって証明され得る。差次的遺伝子発現は、２つ以上の遺伝子もしくはそれらの遺伝子産物の間の発現の比較；または２つ以上の遺伝子もしくはそれらの遺伝子産物の間の発現の比の比較；またはさらに同じ遺伝子の異なってプロセシングされた２つの産物（それらは正常な対象と疾患に罹患する対象との間で；または同じ疾患の様々なステージ間で異なる）の比較を包含し得る。差次的発現は、例えば正常細胞及び疾患細胞の中での、または異なる疾患事象もしくは疾患ステージを経た細胞の中での、遺伝子またはその発現産物の時間的発現パターンまたは細胞発現パターンにおける定量的差そして定性的差の両方を包含する。The terms "differential gene expression" and "differential expression" are used interchangeably to refer to a gene (or its corresponding protein expression product) whose expression is activated to higher or lower levels in a subject afflicted with a particular disease or condition compared to its expression in a normal or control subject. The terms also encompass genes (or their corresponding protein expression products) whose expression is activated to higher or lower levels at different stages of the same disease or condition. It should also be understood that a differentially expressed gene may be activated or inhibited at the nucleic acid or protein level, or may undergo alternative splicing to result in a different polypeptide product. Such differences may be evidenced by a variety of changes, including mRNA levels, surface expression, secretion, or other distribution of polypeptides. Differential gene expression may encompass a comparison of expression between two or more genes or their gene products; or a comparison of the ratio of expression between two or more genes or their gene products; or even a comparison of two differentially processed products of the same gene that differ between normal and diseased subjects; or between various stages of the same disease. Differential expression encompasses both quantitative and qualitative differences in the temporal or cellular expression patterns of a gene or its expression products, for example, between normal and diseased cells, or between cells that have undergone different disease events or stages.

標的分子の「対照レベル」は、同じサンプルタイプの適切に取り扱われたサンプル中の標的分子のレベルを指す。対照レベルは、個体の集団からの適切に取り扱われたサンプル中の標的分子の平均レベルを指し得る。A "control level" of a target molecule refers to the level of the target molecule in properly handled samples of the same sample type. A control level may refer to the average level of the target molecule in properly handled samples from a population of individuals.

本明細書において使用される時、「個体」は、検査対象または患者を指す。個体は、哺乳動物または非哺乳動物であり得る。様々な実施形態において、個体は哺乳動物である。哺乳動物個体は、ヒトまたは非ヒトであり得る。様々な実施形態において、個体はヒトである。As used herein, "individual" refers to a test subject or patient. An individual may be a mammal or a non-mammal. In various embodiments, an individual is a mammal. A mammalian individual may be a human or a non-human. In various embodiments, an individual is a human.

「診断する」、「診断すること」、「診断」、及びそれらの変化形は、１つまたは複数の徴候、症状、データ、またはその個体に属する他の情報に基づいて、個体の健康状態または病態を検出、決定、または認識することを指す。個体の健康状態は、健康／正常と診断され得る（すなわち疾患または病態の非存在の診断）、または病気／異常と診断され得る（すなわち疾患もしくは病態の存在の診断、または疾患もしくは病態の特徴の査定）。「診断する」、「診断すること」、「診断」などという用語は、特定の疾患または病態に関して、疾患の初期検出；疾患の特徴評価または分類；疾患の進行、寛解、または再発の検出；及び治療または治療法の個体への投与後の疾患応答の検出を網羅する。これまでにタバコ使用者であった確率は、１度もタバコ使用者ではなかった個体から、これまでにタバコ使用者（以前または現在）であった個体を区別することを包含する。"Diagnosing," "diagnosing," "diagnosis," and variations thereof refer to detecting, determining, or recognizing an individual's health or condition based on one or more signs, symptoms, data, or other information pertaining to that individual. An individual's health may be diagnosed as healthy/normal (i.e., diagnosing the absence of a disease or condition) or as diseased/abnormal (i.e., diagnosing the presence of a disease or condition, or assessing the characteristics of a disease or condition). The terms "diagnosing," "diagnosing," "diagnosis," and the like, with respect to a particular disease or condition, encompass the initial detection of the disease; the characterization or classification of the disease; the detection of disease progression, remission, or recurrence; and the detection of disease response following administration of a treatment or therapy to an individual. The probability of ever being a tobacco user encompasses distinguishing individuals who were ever tobacco users (former or current) from individuals who were never tobacco users.

「予後予測する」、「予後予測すること」、「予後予測」、及びそれらの変化形は、疾患または病態を有する個体における疾患または病態の将来の経過の予測（例えば予測する患者の生存）を指し、かかる用語は、治療または治療法の個体への投与後の疾患または病態の応答の評価を網羅する。"Prognose," "prognosing," "prognosis," and variations thereof refer to predicting the future course of a disease or condition in an individual having the disease or condition (e.g., predicting patient survival), and such terms encompass the assessment of the response of a disease or condition following administration of a treatment or therapy to an individual.

「評価する」、「評価すること」、「評価」、及びそれらの変化形は、「診断する」及び「予後予測する」の両方を網羅し、疾患を有していない個体における疾患または病態の将来の経過についての決定または予測、そして、明らかに疾患が治癒したかまたは病態の解消を有する個体において、その疾患または病態が再発するリスクに関する決定または予測も網羅する。「評価する」という用語は、治療法への個体の応答を査定すること（例えば個体が治療剤へ順調に応答しそうかもしくは治療剤へ応答しそうにないか（または例えば毒性作用もしくは他の所望されない副作用を経験するだろうか）を予測すること、個体への投与のために治療剤を選択すること、または個体へ投与された治療法への個体の応答をモニタリングもしくは決定すること等）も網羅する。"Evaluate," "assessing," "evaluation," and variations thereof encompass both "diagnosing" and "prognosing," and also encompass the determination or prediction of the future course of a disease or condition in individuals who do not have the disease, and the determination or prediction of the risk of recurrence of the disease or condition in individuals who have apparently been cured of the disease or have resolved the condition. The term "evaluating" also encompasses assessing an individual's response to a therapy (e.g., predicting whether an individual is likely to respond favorably or unlikely to respond to a therapeutic agent (or, for example, will experience toxic or other undesirable side effects), selecting a therapeutic agent for administration to an individual, or monitoring or determining an individual's response to a therapy administered to an individual, etc.).

本明細書において使用される時、「追加の生物医学的情報」は、タバコ使用と関連する本明細書において記載されるバイオマーカーのうちの任意のものを使用すること以外の、個体の１つまたは複数の評価を指す。「追加の生物医学的情報」としては、以下の：個体の身長及び／または体重を包含する個体の身体的な記述子；個体の年齢；個体の性別；体重の変化；個体の民族性；職業歴；タバコ使用の家族歴；遺伝的マーカー（複数可）の存在；臨床症状（腹痛、体重増減、遺伝子発現値等）；個体の身体的な記述子（放射線イメージングによって観察された身体的な記述子を包含する）；タバコ使用状態；アルコール使用の履歴；職業歴；食習慣（塩、飽和脂肪、及びコレステロールの摂取）；カフェイン消費量；及びイメージング情報のうちの任意のものが挙げられる。追加の生物医学的情報は、当技術分野において公知のルーチンの技法を使用して、個体から（ルーチンの患者質問票または健康歴質問票などの使用によって個体自体から、または医療従事者からなど）得られ得る。As used herein, "additional biomedical information" refers to one or more assessments of an individual other than using any of the biomarkers described herein associated with tobacco use. "Additional biomedical information" includes any of the following: physical descriptors of the individual, including the individual's height and/or weight; the individual's age; the individual's sex; weight change; the individual's ethnicity; occupational history; family history of tobacco use; presence of genetic marker(s); clinical symptoms (abdominal pain, weight gain or loss, gene expression levels, etc.); physical descriptors of the individual (including physical descriptors observed by radiological imaging); tobacco use status; history of alcohol use; occupational history; dietary habits (salt, saturated fat, and cholesterol intake); caffeine consumption; and imaging information. Additional biomedical information can be obtained from the individual (e.g., from the individual themselves, such as through the use of routine patient questionnaires or health history questionnaires, or from a healthcare professional) using routine techniques known in the art.

本明細書において使用される時、バイオマーカー値に関して「検出すること」または「決定すること」は、バイオマーカー値に対応するシグナルを観察及び記録するのに要求される装置ならびにそのシグナルを生成するのに要求される材料（複数可）の使用の両方を包含する。様々な実施形態において、バイオマーカー値は、任意の好適な方法を使用して検出され、当該方法としては、蛍光、化学発光、表面プラズモン共鳴、表面弾性波、質量分析法、赤外分光法、ラマン分光法、原子間力顕微鏡、走査トンネル顕微鏡、電気化学検出法、核磁気共鳴、量子ドット、及び同種のものが挙げられる。As used herein, "detecting" or "determining" with respect to a biomarker value encompasses both the use of the equipment required to observe and record the signal corresponding to the biomarker value as well as the material(s) required to generate that signal. In various embodiments, the biomarker value is detected using any suitable method, including fluorescence, chemiluminescence, surface plasmon resonance, surface acoustic waves, mass spectrometry, infrared spectroscopy, Raman spectroscopy, atomic force microscopy, scanning tunneling microscopy, electrochemical detection, nuclear magnetic resonance, quantum dots, and the like.

「固体支持体」は、分子が、共有結合または非共有結合のいずれかを介して直接的または間接的に結合され得る表面を有する任意の基材を本明細書において指す。「固体支持体」は様々な物理フォーマットを有し得、当該物理フォーマットとしては、例えば膜；チップ（例えばタンパク質チップ）；スライド（例えばスライドグラスまたはカバースリップ）；カラム；中空の、固体の、半固体の、孔含有の、または空洞含有の粒子（例えばビーズ等）；ゲル；ファイバー（光ファイバー材料が挙げられる）；マトリックス；及びサンプル容器が挙げられ得る。例示的なサンプル容器としては、サンプルウェル、チューブ、キャピラリー、バイアル、及び他の容器、サンプルを維持することができる溝または窪みが挙げられる。サンプル容器は、マルチサンプルプラットフォーム（マイクロタイタープレート、スライド、マイクロ流体デバイス、及び同種のもの等）に含有され得る。支持体は、天然材料または合成材料、有機材料または無機材料から構成され得る。捕捉試薬が結合される固体支持体の組成は、概して、結合の方法（例えば共有結）に依存する。他の例示的な容器としては、アッセイ及び関連するマニピュレーションが起こり得る、微小液滴、マイクロ流体制御されたエマルション、またはバルクの油／水エマルションが挙げられる。好適な固体支持体としては、例えばプラスチック、樹脂、多糖、シリカまたはシリカベースの材料、官能化ガラス、変性ケイ素、炭素、金属、無機ガラス、膜、ナイロン、天然繊維（例えば絹、羊毛、及びワタ等）、ポリマー、及び同種のものが挙げられる。固体支持体を構成する材料は、捕捉試薬の結合のために使用される反応性基（例えばカルボキシ基、アミノ基、またはヒドロキシル基等）を含み得る。ポリマー固体支持体としては、例えばポリスチレン、ポリエチレングリコールテトラフタレート、ポリ酢酸ビニル、ポリ塩化ビニル、ポリビニルピロリドン、ポリアクリロニトリル、ポリメチルメタクリレート、ポリテトラフルオロエチレン、ブチルゴム、スチレンブタジエンゴム、天然ゴム、ポリエチレン、ポリプロピレン、（ポリ）テトラフルオロエチレン、（ポリ）フッ化ビニリデン、ポリカーボネート、及びポリメチルペンテンが挙げられ得る。使用され得る好適な固体支持体粒子としては、例えばコード化粒子（Ｌｕｍｉｎｅｘ（登録商標）タイプのコード化粒子等）、磁性粒子、及びガラス微粒子が挙げられる。"Solid support" refers herein to any substrate having a surface to which molecules can be directly or indirectly attached, either covalently or non-covalently. A "solid support" can have a variety of physical formats, including, for example, membranes; chips (e.g., protein chips); slides (e.g., glass slides or coverslips); columns; hollow, solid, semi-solid, hole-containing, or cavity-containing particles (e.g., beads); gels; fibers (including fiber optic materials); matrices; and sample vessels. Exemplary sample vessels include sample wells, tubes, capillaries, vials, and other vessels, grooves, or depressions capable of holding a sample. Sample vessels can be contained in multi-sample platforms (e.g., microtiter plates, slides, microfluidic devices, and the like). Supports can be composed of natural or synthetic, organic, or inorganic materials. The composition of the solid support to which the capture reagent is attached generally depends on the method of attachment (e.g., covalent). Other exemplary vessels include microdroplets, microfluidically controlled emulsions, or bulk oil/water emulsions in which assays and related manipulations can occur. Suitable solid supports include, for example, plastics, resins, polysaccharides, silica or silica-based materials, functionalized glass, modified silicon, carbon, metals, inorganic glass, membranes, nylon, natural fibers (e.g., silk, wool, cotton, etc.), polymers, and the like. The material comprising the solid support may contain reactive groups (e.g., carboxyl, amino, or hydroxyl groups) used for binding capture reagents. Polymeric solid supports may include, for example, polystyrene, polyethylene glycol tetraphthalate, polyvinyl acetate, polyvinyl chloride, polyvinylpyrrolidone, polyacrylonitrile, polymethyl methacrylate, polytetrafluoroethylene, butyl rubber, styrene butadiene rubber, natural rubber, polyethylene, polypropylene, (poly)tetrafluoroethylene, (poly)vinylidene fluoride, polycarbonate, and polymethylpentene. Suitable solid support particles that can be used include, for example, encoded particles (such as Luminex®-type encoded particles), magnetic particles, and glass microparticles.

本明細書において使用される時、「分析物」は、捕捉試薬のタンパク質標的である。ある特定の態様において、捕捉試薬はアプタマーである。ある特定のさらなる態様において、捕捉試薬はＳＯＭＡｍｅｒである。As used herein, an "analyte" is the protein target of a capture reagent. In certain embodiments, the capture reagent is an aptamer. In certain further embodiments, the capture reagent is a SOMAmer.

本明細書において使用される時、「ＬｉｎのＣＣＣ」は、新たな検査とゴールドスタンダードと判断される既存の検査との間の一致を測定する一致相関係数を意味する。As used herein, "Lin's CCC" refers to the concordance correlation coefficient, which measures the agreement between a new test and an existing test considered the gold standard.

本明細書において使用される時、「試験」は、検査を導くために分析されるサンプルのセット及び臨床データを意味する。As used herein, "test" means a set of samples and clinical data that are analyzed to derive a test.

本明細書において使用される時、「訓練データセット」は、モデルに適合させるために使用される、試験からのデータのサブセットを意味する。As used herein, "training dataset" means a subset of data from a study that is used to fit a model.

本明細書において使用される時、「妥当性確認データセット」は、検証データセット上で開発された選択したモデルの性能を査定するために使用されるデータの最終的なサブセットを意味する。As used herein, "validation dataset" refers to the final subset of data used to assess the performance of the selected model developed on the validation dataset.

本明細書において使用される時、「検証データセット」は、モデルパラメーターを調整しながら、訓練データセットに適合させたモデルの偏りがない評価を提供するために使用された、データの分離したサブセットを意味する。As used herein, "validation dataset" means a separate subset of data used to provide an unbiased evaluation of a model fitted to the training dataset while adjusting model parameters.

本明細書において使用される時、「必要とする」または「必要とされる」という用語は、ヘルスケア提供者によって患者の健康状態に有益であると判断される患者の治療に関してヘルスケア提供者によって下された判断を指す。As used herein, the terms "need" or "required" refer to a judgment made by a healthcare provider regarding treatment of a patient that is determined by the healthcare provider to be beneficial to the patient's health condition.

リスク分析
いくつかの態様において、個体がこれまでにタバコ使用者であった確率を予測するための客観的検査が本明細書において開示される。 Risk Analysis In some embodiments, disclosed herein are objective tests for predicting the probability that an individual is a former tobacco user.

リスク分析プロファイルは、表１中でのように記載され得る。
The risk analysis profile can be described as in Table 1.

本明細書において開示される検査方法は、主観的な自己申告、誤解を招く自己申告、または偽造された自己申告に依存すること無しに、現在または以前の（すなわち「これまでの」）タバコ使用習慣についての知見を提供するだろう。本明細書において開示される検査方法は、タバコ使用関連病態を予防または遅延させ得るポジティブな挙動変化（すなわちタバコ使用の中断）に影響を及ぼし得る。本明細書において開示される検査方法は、ヘルスケア提供者及び患者がタバコ使用習慣、タバコ使用中断の遵守、及びプロテオームへのタバコ使用の経時的な影響をモニタリングすることを可能にし得る。本明細書において開示される検査方法は、疾患スクリーニング適格性についての個体のリスク査定のために考慮され得る。
The testing methods disclosed herein may provide insight into current or former (i.e., "ever") tobacco use habits without relying on subjective, misleading, or falsified self-reporting. The testing methods disclosed herein may influence positive behavioral changes (i.e., tobacco use cessation) that may prevent or delay tobacco use-related pathologies. The testing methods disclosed herein may enable health care providers and patients to monitor tobacco use habits, adherence to tobacco use cessation, and the effects of tobacco use on the proteome over time. The testing methods disclosed herein may be considered for individual risk assessment for disease screening eligibility.

性能閾値は、喫煙習慣を予測するための市販されている主要なエピジェネティック予測モデルであるＡＨＲＲ遺伝子のＤＮＡメチル化の性能に基づいて確立された（ＬａｎｇｄｏｎＲＪ，ｅｔａｌ．，ＣｌｉｎＥｐｉｇｅｎｅｔｉｃｓ，２０２１；１３（１）：２０６；ＰｈｉｌｉｂｅｒｔＲ，ｅｔａｌ．，ＦｒｏｎｔＧｅｎｅｔ，２０１８；９：１３７）。The performance threshold was established based on the performance of DNA methylation of the AHRR gene, the leading commercially available epigenetic prediction model for predicting smoking habits (Langdon RJ, et al., Clin Epigenetics, 2021;13(1):206; Philibert R, et al., Front Genet, 2018;9:137).

「以前の」そして「現在の」タバコ使用状態を予測する市販されている検査はない。最近の（すなわち３日未満の）タバコ使用を予測するために、コチニン（ニコチン代謝産物）は、臨床設定において一般的に使用される（ＣｅｎｔｅｒｓｆｏｒＤｉｓｅａｓｅＣｏｎｔｒｏｌａｎｄＰｒｅｖｅｎｔｉｏｎ．ＢｉｏｍｏｎｉｔｏｒｉｎｇＳｕｍｍａｒｙ：Ｃｏｔｉｎｉｎｅ．Ａｐｒｉｌ２０１７．同文献は、ｈｔｔｐｓ：／／ｗｗｗ．ｃｄｃ．ｇｏｖ／ｂｉｏｍｏｎｉｔｏｒｉｎｇ／Ｃｏｔｉｎｉｎｅ＿ＢｉｏｍｏｎｉｔｏｒｉｎｇＳｕｍｍａｒｙ．ｈｔｍｌから利用可能）。しかしながら、コチニンはタバコ煙曝露の長期的な生物学的影響を示さないので、比較可能な参照ツールではない。本明細書において開示される検査方法は、現在のエピジェネティック予測モデルの性能に少なくとも匹敵し、例えばＢｅｈａｖｉｏｒａｌＤｉａｇｎｏｓｔｉｃｓ，ＬＬＣによって提供されるＳｍｏｋｅＳｉｇｎａｔｕｒｅ（登録商標）検査は、ＡＨＲＲ遺伝子のＤＮＡメチル化についてのカットオフを使用して「現在の」喫煙者ｖｓ非喫煙者のみを層別化することを請求する（ＰｈｉｌｉｂｅｒｔＲ，ｅｔａｌ．，ＦｒｏｎｔＧｅｎｅｔ，２０１８；９：１３７；ＤａｗｅｓＫ，ｅｔａｌ．ＳｃｉＲｅｐ，２０２１；１１（１）：２１６２７）。最近、三成分の喫煙状態を分類する能力を査定するために、ＡＨＲＲメチル化を２ステージアプローチに適用し、ＡＵＣは０．７１７～０．９０２に範囲により遂行した（ＬａｎｇｄｏｎＲＪ，ｅｔａｌ．，ＣｌｉｎＥｐｉｇｅｎｅｔｉｃｓ，２０２１；１３（１）：２０６）。エピジェネティックな変化は定量的であり得るが（ＰｈｉｌｉｂｅｒｔＲ，ｅｔａｌ．，ＦｒｏｎｔＰｓｙｃｈｉａｔｒｙ，２０１６；７：５５）、これらの変化は、煙暴露に対する有意な変更を要求し、喫煙状態クラスにおける測定可能な差を示すためには、長期間（すなわち数年）かかり（ＭｃＣａｒｔｎｅｙＤＬ，ｅｔａｌ．，ＥＢｉｏＭｅｄｉｃｉｎｅ，２０１８；３７：２１４－２２０）、したがって、動的プロテオームの査定は、喫煙習慣におけるより急速な変化を予測しやすくなり得る。本明細書において開示される方法の実施形態は０．６５以上、０．６６、０．６７、０．６８、０．６９、０．７０、０．７１７超のＡＵＣにより遂行され、０．７１７は、三成分の分類について報告されたＡＨＲＲメチル化性能の下限である。There are no commercially available tests that predict "former" and "current" tobacco use status. Cotinine (a nicotine metabolite) is commonly used in clinical settings to predict recent (i.e., less than 3 days) tobacco use (Centers for Disease Control and Prevention. Biomonitoring Summary: Cotinine. April 2017. Available at https://www.cdc.gov/biomonitoring/Cotinine_BiomonitoringSummary.html). However, cotinine is not a comparable reference tool because it does not indicate the long-term biological effects of tobacco smoke exposure. The testing methods disclosed herein at least match the performance of current epigenetic prediction models; for example, the Smoke Signature® test offered by Behavioral Diagnostics, LLC claims to stratify only "current" smokers vs. non-smokers using a cutoff for DNA methylation of the AHRR gene (Philibert R, et al., Front Genet, 2018; 9:137; Dawes K, et al. Sci Rep, 2021; 11(1):21627). Recently, AHRR methylation was applied to a two-stage approach to assess its ability to classify three-component smoking status, with AUCs ranging from 0.717 to 0.902 (Langdon RJ, et al., Clin Epigenetics, 2021;13(1):206). Although epigenetic changes can be quantitative (Philibert R, et al., Front Psychiatry, 2016;7:55), these changes require significant alterations in response to smoke exposure and take long periods (i.e., years) to show measurable differences in smoking status classes (McCartney DL, et al., EBioMedicine, 2018;37:214-220). Therefore, dynamic proteomic assessment may be more predictive of more rapid changes in smoking habits. Embodiments of the methods disclosed herein perform with an AUC of 0.65 or greater, 0.66, 0.67, 0.68, 0.69, 0.70, or greater than 0.717, with 0.717 being the lower bound of reported AHRR methylation performance for the ternary classification.

いくつかの実施形態において、バイオマーカーのサブセットまたはパネルに有用なバイオマーカーの数は、バイオマーカーレベルの特定の組み合わせについての感度及び特異度の値に基づく。「感度」及び「特異度」という用語は、生物学的サンプル中で検出された１つまたは複数のバイオマーカーレベルに基づいて、これまでに喫煙したか、または１度も喫煙したことがないとして、個体を正確に分類する能力に関して本明細書において使用される。「感度」は、個体が、これまでに（現在及び以前に）喫煙していたか、または１度も喫煙したことがないと正確に分類することに関するバイオマーカー（複数可）の性能を示す。「特異度」は、喫煙していない個体を正確に分類することに関するバイオマーカー（複数可）の性能を示す。In some embodiments, the number of biomarkers useful in a biomarker subset or panel is based on the sensitivity and specificity values for a particular combination of biomarker levels. The terms "sensitivity" and "specificity" are used herein in reference to the ability to accurately classify an individual as an ever smoker or a never smoker based on one or more biomarker levels detected in a biological sample. "Sensitivity" refers to the performance of a biomarker(s) in accurately classifying an individual as an ever (current and former) smoker or a never smoker. "Specificity" refers to the performance of a biomarker(s) in accurately classifying individuals who have never smoked.

いくつかの実施形態において、１つまたは複数のバイオマーカーのパネルの全体的な性能は曲線下面積（ＡＵＣ）値によって表わされる。ＡＵＣ値は、受信者動作特性（ＲＯＣ）曲線から導出される。ＲＯＣ曲線は、検査の偽陽性率（１－特異度）に対する検査の真陽性率（感度）のプロットである。「曲線下面積」または「ＡＵＣ」という用語は、受信者動作特性（ＲＯＣ）曲線の曲線下面積を指し、その両方は当技術分野において周知である。ＡＵＣ測定は、完全なデータ範囲にわたる分類子の正確性を比較するのに有用である。より大きいＡＵＣの分類子は、関心のある２つの群（例えばこれまでにタバコ使用者（以前及び現在）と１度もタバコ使用をしたことがない者）の間で未知のものを正確に分類する、より高い能力を有する。ＲＯＣ曲線は、２つの集団の区別における特定のフィーチャ（例えば本明細書において記載されるバイオマーカーのうちの任意のもの及び／または追加の生物医学的情報の任意の項目）の性能をプロットするのに有用である。典型的には、集団にわたるフィーチャデータは、単一のフィーチャの値に基づいた昇順で選別される。次いで、そのフィーチャの各々の値について、データの真陽性率及び偽陽性率が計算される。真陽性率は、そのフィーチャについて値を超えた事例の数をカウントし、次いで、事例の総数によって除算することによって決定される。偽陽性率は、そのフィーチャについて値を超えた対照の数をカウントし、次いで、対照の総数によって除算することによって決定される。この定義は、フィーチャが対照に比較した場合に上昇するシナリオを指すが、この定義は、フィーチャが対照に比較した場合により低いシナリオへも適用される（かかるシナリオにおいて、そのフィーチャについて値未満のサンプルがカウントされるだろう）。ＲＯＣ曲線は、単一のフィーチャ、そして他の単一の出力について生成され得、例えば２つ以上のフィーチャの組み合わせは数学的に組み合わせられて（例えば、加算。減算、乗算されてなど）、単一の合計値を提供することができ、この単一の合計値は、ＲＯＣ曲線においてプロットされ得る。追加で、組み合わせが単一の出力値を導出する、複数のフィーチャの任意の組み合わせは、ＲＯＣ曲線においてプロットされ得る。In some embodiments, the overall performance of a panel of one or more biomarkers is represented by an area under the curve (AUC) value. AUC values are derived from a receiver operating characteristic (ROC) curve. An ROC curve is a plot of a test's true positive rate (sensitivity) against its false positive rate (1 - specificity). The terms "area under the curve" or "AUC" refer to the area under a receiver operating characteristic (ROC) curve, both of which are well known in the art. AUC measurements are useful for comparing the accuracy of classifiers across the complete data range. A classifier with a larger AUC has a higher ability to accurately classify unknowns between two groups of interest (e.g., ever tobacco users (former and current) vs. never tobacco users). ROC curves are useful for plotting the performance of a particular feature (e.g., any of the biomarkers described herein and/or any item of additional biomedical information) in distinguishing two populations. Typically, feature data across populations is sorted in ascending order based on the value of a single feature. The true positive rate and false positive rate of the data are then calculated for each value of the feature. The true positive rate is determined by counting the number of cases that exceed the value for that feature and then dividing by the total number of cases. The false positive rate is determined by counting the number of controls that exceed the value for that feature and then dividing by the total number of controls. While this definition refers to scenarios in which the feature is elevated compared to the control, it also applies to scenarios in which the feature is lower compared to the control (in such scenarios, samples below the value for the feature would be counted). ROC curves can be generated for single features and other single outputs; for example, combinations of two or more features can be mathematically combined (e.g., added, subtracted, multiplied, etc.) to provide a single total value, which can be plotted in the ROC curve. Additionally, any combination of multiple features whose combination derives a single output value can be plotted in the ROC curve.

バイオマーカーの例示的な使用
様々な例示的な実施形態において、多数の分析方法（本明細書において記載される分析方法のうちの任意のものを包含する）によって、個体の循環中に（血液、血清、または血漿等中に）存在する、１つまたは複数のバイオマーカーに対応する１つまたは複数のバイオマーカー値を検出することによる、個体がこれまでにタバコ使用者であった確率の予測のための方法が供される。これらのバイオマーカーは、例えば１度も喫煙しなかった個体に比較して、これまでに（以前及び現在の）タバコ使用者であった個体において差次的に発現される。個体におけるバイオマーカーの差次的発現の検出を使用して、例えばこれまでにタバコ使用者であった確率の予測を可能にすることができる。 Exemplary Uses of Biomarkers In various exemplary embodiments, methods are provided for predicting the probability that an individual is a former tobacco user by detecting one or more biomarker values corresponding to one or more biomarkers present in the individual's circulation (e.g., in blood, serum, or plasma) by a number of analytical methods (including any of the analytical methods described herein). These biomarkers are differentially expressed in individuals who are former (former and current) tobacco users compared to individuals who have never smoked, for example. Detection of differential expression of biomarkers in an individual can be used, for example, to enable prediction of the probability of being a former tobacco user.

独立型の診断検査としてのバイオマーカーレベルの検査に加えて、バイオマーカーレベルは、疾患または病態の感受性のリスクの増加を表すＳＮＰまたは他の遺伝的な病変もしくは変動の決定と併用しても行われ得る（例えばＡｍｏｓｅｔａｌ．，ＮａｔｕｒｅＧｅｎｅｔｉｃｓ４０，６１６－６２２（２００９）を参照）。In addition to examining biomarker levels as a stand-alone diagnostic test, biomarker levels can also be performed in conjunction with determining SNPs or other genetic lesions or variations that indicate an increased risk of susceptibility to a disease or condition (see, e.g., Amos et al., Nature Genetics 40, 616-622 (2009)).

記載されるバイオマーカーのうちの任意のものは、イメージング検査において使用され得る。例えば、造影剤は、記載されるバイオマーカーのうちの任意のものへカップリングされ得、これは、他の使用の中でも、個体がこれまでにタバコ使用者であった確率の予測を支援するために、治療介入への応答をモニタリングするために、臨床試験において標的集団を選択するために使用され得る。Any of the described biomarkers can be used in imaging studies. For example, an imaging agent can be coupled to any of the described biomarkers, which can be used to aid in predicting the probability that an individual is a former tobacco user, to monitor response to therapeutic interventions, to select target populations in clinical trials, among other uses.

バイオマーカー及びバイオマーカーレベルの検出及び決定
本明細書において記載されるバイオマーカーについてのバイオマーカーレベルは、様々な公知の分析方法のうちの任意のものを使用して検出され得る。一実施形態において、バイオマーカーレベルは、捕捉試薬を使用して検出される。本明細書において使用される時、「捕捉剤」または「捕捉試薬」は、バイオマーカーへ特異的に結合することが可能な分子を指す。様々な実施形態において、捕捉試薬は、溶液中でバイオマーカーへ曝露され得るか、または捕捉試薬が固体支持体に固定化された状態でバイオマーカーへ曝露され得る。他の実施形態において、捕捉試薬は、固体支持体上の二次的なフィーチャと反応性のフィーチャを含有する。これらの実施形態において、捕捉試薬は、溶液中のバイオマーカーへ曝露され得、次いで、捕捉試薬上のフィーチャを、固体支持体上の二次的なフィーチャと併用して使用して、固体支持体上にバイオマーカーを固定化することができる。捕捉試薬は、遂行される分析のタイプに基づいて選択される。捕捉試薬としては、ＳＯＭＡｍｅｒ、抗体、アドネクチン、アンキリン、他の抗体模倣体及び他のタンパク質スキャフォールド、自己抗体、キメラ、小分子、Ｆ（ａｂ’）_２断片、一本鎖抗体断片、Ｆｖ断片、一本鎖Ｆｖ断片、核酸、レクチン、リガンド結合受容体、アフィボディ、ナノボディ、インプリントポリマー、アビマー、ペプチド模倣体、ホルモン受容体、サイトカイン受容体及び合成受容体、ならびにこれらの修飾物及び断片が挙げられるがこれらに限定されない。 Detection and Determination of Biomarkers and Biomarker Levels Biomarker levels for the biomarkers described herein can be detected using any of a variety of known analytical methods. In one embodiment, biomarker levels are detected using a capture reagent. As used herein, "capture agent" or "capture reagent" refers to a molecule capable of specifically binding to a biomarker. In various embodiments, the capture reagent can be exposed to the biomarker in solution, or the capture reagent can be exposed to the biomarker while immobilized on a solid support. In other embodiments, the capture reagent contains features reactive with secondary features on the solid support. In these embodiments, the capture reagent can be exposed to the biomarker in solution, and then the features on the capture reagent can be used in conjunction with the secondary features on the solid support to immobilize the biomarker on the solid support. The capture reagent is selected based on the type of analysis to be performed. Capture reagents include, but are not limited to, SOMAmers, antibodies, adnectins, ankyrins, other antibody mimetics and other protein scaffolds, autoantibodies, chimeras, small molecules, F(ab')₂ fragments, single chain antibody fragments, Fv fragments, single chain Fv fragments, nucleic acids, lectins, ligand binding receptors, affibodies, nanobodies, imprinted polymers, avimers, peptidomimetics, hormone receptors, cytokine receptors and synthetic receptors, and modifications and fragments thereof.

いくつかの実施形態において、バイオマーカーレベルは、バイオマーカー／捕捉試薬複合体を使用して検出される。In some embodiments, biomarker levels are detected using a biomarker/capture reagent complex.

他の実施形態において、バイオマーカーレベルは、バイオマーカー／捕捉試薬複合体に由来し、例えばバイオマーカー／捕捉試薬相互作用に後続する反応の結果として等で、間接的に検出されるが、バイオマーカー／捕捉試薬複合体の形成に依存する。In other embodiments, the biomarker level is derived from the biomarker/capture reagent complex and is detected indirectly, e.g., as a result of a reaction subsequent to the biomarker/capture reagent interaction, but dependent on the formation of the biomarker/capture reagent complex.

いくつかの実施形態において、バイオマーカーレベルは、生物学的サンプル中のバイオマーカーから直接的に検出される。In some embodiments, the biomarker level is detected directly from the biomarker in the biological sample.

一実施形態において、バイオマーカーは、生物学的サンプル中の２つ以上のバイオマーカーの同時検出を可能にするマルチプレックスフォーマットを使用して検出される。マルチプレックスフォーマットの一実施形態において、捕捉試薬は、固体支持体上の不連続の所在位置に、共有結合または非共有結合により直接的または間接的に固定化される。別の実施形態において、マルチプレックスフォーマットは、不連続の固体支持体を使用し、各々の固体支持体は、その固体支持体と会合する一意的な捕捉試薬（例えば量子ドット等）を有する。別の実施形態において、個別のデバイスは、生物学的サンプル中の検出されるべき複数のバイオマーカーのうちの各々の検出のために使用される。個別のデバイスは、生物学的サンプル中の各々のバイオマーカーが同時にプロセシングされることを許容するように構成され得る。例えば、プレート中の各々ウェルが、生物学的サンプル中の検出されるべき複数のバイオマーカーのうちの１つを一意的に分析するために使用されるように、マイクロタイタープレートは使用され得る。In one embodiment, biomarkers are detected using a multiplex format, which allows for the simultaneous detection of two or more biomarkers in a biological sample. In one embodiment of a multiplex format, capture reagents are immobilized, directly or indirectly, by covalent or non-covalent attachment to discrete locations on a solid support. In another embodiment, the multiplex format uses discrete solid supports, each with a unique capture reagent (e.g., quantum dots) associated with it. In another embodiment, a separate device is used for the detection of each of the multiple biomarkers to be detected in the biological sample. The separate devices can be configured to allow each biomarker in the biological sample to be processed simultaneously. For example, a microtiter plate can be used such that each well in the plate is used to uniquely analyze one of the multiple biomarkers to be detected in the biological sample.

前述の実施形態のうちの１つまたは複数において、バイオマーカー値の検出を可能にするために、蛍光タグを使用して、バイオマーカー／捕捉物複合体の構成要素が標識され得る。様々な実施形態において、蛍光標識は、本明細書において記載されるバイオマーカーのうちの任意のものへ特異的な捕捉試薬へ、公知の技法を使用してコンジュゲートされ得、次いで蛍光標識は、対応するバイオマーカー値を検出するために使用され得る。好適な蛍光標識としては、希土類キレート、フルオレセイン及びその誘導体、ローダミン及びその誘導体、ダンシル、アロフィコシアニン、ＰＢＸＬ－３、Ｑｄｏｔ６０５、Ｌｉｓｓａｍｉｎｅ、フィコエリトリン、ＴｅｘａｓＲｅｄ、ならびに他のかかる化合物が挙げられる。In one or more of the foregoing embodiments, a component of the biomarker/capture complex may be labeled using a fluorescent tag to enable detection of the biomarker value. In various embodiments, a fluorescent label may be conjugated to a capture reagent specific for any of the biomarkers described herein using known techniques, and the fluorescent label may then be used to detect the corresponding biomarker value. Suitable fluorescent labels include rare earth chelates, fluorescein and its derivatives, rhodamine and its derivatives, dansyl, allophycocyanin, PBXL-3, Qdot 605, Lissamine, phycoerythrin, Texas Red, and other such compounds.

一実施形態において、蛍光標識は、蛍光色素分子である。いくつかの実施形態において、蛍光色素分子は、インドリウム環の３－炭素上の置換基が化学反応性基またはコンジュゲートされた物質を含有する、少なくとも１つの置換されたインドリウム環系を含む。いくつかの実施形態において、色素分子としては、例えばＡｌｅｘａＦｌｕｏｒ４８８、ＡｌｅｘａＦｌｕｏｒ５３２、ＡｌｅｘａＦｌｕｏｒ６４７、ＡｌｅｘａＦｌｕｏｒ６８０、またはＡｌｅｘａＦｌｕｏｒ７００等のＡｌｅｘＦｌｕｏｒ分子が挙げられる。他の実施形態において、色素分子は、第１のタイプ及び第２のタイプの色素分子（例えば２つの異なるＡｌｅｘａＦｌｕｏｒ分子等）を包含する。他の実施形態において、色素分子は、第１のタイプ及び第２のタイプの色素分子を包含し、２つの色素分子は異なる発光スペクトルを有する。In one embodiment, the fluorescent label is a fluorescent dye molecule. In some embodiments, the fluorescent dye molecule comprises at least one substituted indolium ring system in which a substituent on the 3-carbon of the indolium ring contains a chemically reactive group or a conjugated substance. In some embodiments, the dye molecule includes an AlexFluor molecule, such as AlexaFluor 488, AlexaFluor 532, AlexaFluor 647, AlexaFluor 680, or AlexaFluor 700. In other embodiments, the dye molecule includes a first type and a second type of dye molecule (e.g., two different AlexaFluor molecules). In other embodiments, the dye molecule includes a first type and a second type of dye molecule, where the two dye molecules have different emission spectra.

蛍光は、広範囲のアッセイフォーマットと適合性のある様々な装置類により測定され得る。例えば、分光蛍光計は、マイクロタイタープレート、顕微鏡用スライド、プリントアレイ、キュベットなどを分析するようにデザインされている。Ｊ．Ｒ．Ｌａｋｏｗｉｃｚ，ＳｐｒｉｎｇｅｒＳｃｉｅｎｃｅ＋ＢｕｓｉｎｅｓｓＭｅｄｉａ，Ｉｎｃ．，２００４による、ＰｒｉｎｃｉｐｌｅｓｏｆＦｌｕｏｒｅｓｃｅｎｃｅＳｐｅｃｔｒｏｓｃｏｐｙを参照されたい。Ｂｉｏｌｕｍｉｎｅｓｃｅｎｃｅ＆Ｃｈｅｍｉｌｕｍｉｎｅｓｃｅｎｃｅ：Ｐｒｏｇｒｅｓｓ＆ＣｕｒｒｅｎｔＡｐｐｌｉｃａｔｉｏｎｓ；ＰｈｉｌｉｐＥ．ＳｔａｎｌｅｙａｎｄＬａｒｒｙＪ．Ｋｒｉｃｋａｅｄｉｔｏｒｓ，ＷｏｒｌｄＳｃｉｅｎｔｉｆｉｃＰｕｂｌｉｓｈｉｎｇＣｏｍｐａｎｙ，Ｊａｎｕａｒｙ２００２を参照されたい。Fluorescence can be measured by a variety of instruments compatible with a wide range of assay formats. For example, spectrofluorometers are designed to analyze microtiter plates, microscope slides, printed arrays, cuvettes, etc. See "Principles of Fluorescence Spectroscopy" by J. R. Lakowicz, Springer Science + Business Media, Inc., 2004; "Bioluminescence & Chemiluminescence: Progress & Current Applications" by Philip E. Stanley and Larry J. See Kricka editors, World Scientific Publishing Company, January 2002.

前述の実施形態のうちの１つまたは複数において、バイオマーカー値の検出を可能にするために、化学発光タグを任意選択で使用して、バイオマーカー／捕捉物複合体の構成要素が標識され得る。好適な化学発光材料としては、塩化オキサリル、ローダミン（Ｒｏｄａｍｉｎ）６Ｇ、Ｒｕ（ｂｉｐｙ）３２＋、ＴＭＡＥ（テトラキス（ジメチルアミノ）エチレン）、ピロガロール（１，２，３－トリヒドロキシベンゼン（ｔｒｉｈｙｄｒｏｘｉｂｅｎｚｅｎｅ））、ルシゲニン、ペルオキシオキサレート、アリールオキサレート、アクリジニウムエステル、ジオキセタン、及び他のもののうちの任意のものが挙げられる。In one or more of the foregoing embodiments, a chemiluminescent tag can optionally be used to label a component of the biomarker/capture complex to enable detection of the biomarker value. Suitable chemiluminescent materials include any of oxalyl chloride, rhodamine 6G, Ru(bipy)32+, TMAE (tetrakis(dimethylamino)ethylene), pyrogallol (1,2,3-trihydroxybenzene), lucigenin, peroxyoxalates, aryloxalates, acridinium esters, dioxetanes, and others.

さらに他の実施形態において、検出方法は、バイオマーカー値に対応する検出可能なシグナルを生成する酵素／基質の組み合わせを包含する。概して、酵素は、様々な技法（分光測光、蛍光、及び化学発光が挙げられる）を使用して測定され得る色素形成基質の化学的変更を触媒する。好適な酵素としては、例えばルシフェラーゼ、ルシフェリン、リンゴ酸脱水素酵素、ウレアーゼ、ホースラディッシュペルオキシダーゼ（ＨＲＰＯ）、アルカリホスファターゼ、β－ガラクトシダーゼ、グルコアミラーゼ、リゾチーム、グルコースオキシダーゼ、ガラクトース酸化酵素及びグルコース－６－リン酸脱水素酵素、ウリカーゼ、キサンチンオキシダーゼ、ラクトペルオキシダーゼ、ミクロペルオキシダーゼ、ならびに同種のものが挙げられる。In yet other embodiments, the detection method involves an enzyme/substrate combination that generates a detectable signal corresponding to the biomarker value. Generally, the enzyme catalyzes a chemical alteration of a chromogenic substrate that can be measured using a variety of techniques, including spectrophotometry, fluorescence, and chemiluminescence. Suitable enzymes include, for example, luciferase, luciferin, malate dehydrogenase, urease, horseradish peroxidase (HRPO), alkaline phosphatase, β-galactosidase, glucoamylase, lysozyme, glucose oxidase, galactose oxidase and glucose-6-phosphate dehydrogenase, uricase, xanthine oxidase, lactoperoxidase, microperoxidase, and the like.

さらに他の実施形態において、検出方法は、蛍光、化学発光、または放射性核種、または測定可能なシグナルを生成する酵素／基質組み合わせの組み合わせであり得る。多モードのシグナル生成は、バイオマーカーアッセイフォーマットにおいて固有且つ有利な特徴を有し得る。In yet other embodiments, the detection method may be a combination of fluorescent, chemiluminescent, or radionuclide, or enzyme/substrate combinations that generate a measurable signal. Multimodal signal generation may be a unique and advantageous feature in biomarker assay formats.

より具体的には、本明細書において記載されるバイオマーカーについてのバイオマーカーレベルは、公知の分析方法を使用して検出され得、当該分析方法としては、以下で詳述されるような、シングルプレックスＳＯＭＡｍｅｒアッセイ、マルチプレックスＳＯＭＡｍｅｒアッセイ、シングルプレックスまたはマルチプレックス免疫アッセイ、ｍＲＮＡ発現プロファイリング、ｍｉＲＮＡ発現プロファイリング、質量分光分析、組織学的／細胞学的な方法などが挙げられる。More specifically, biomarker levels for the biomarkers described herein can be detected using known analytical methods, including singleplex SOMAmer assays, multiplex SOMAmer assays, singleplex or multiplex immunoassays, mRNA expression profiling, miRNA expression profiling, mass spectrometry, histological/cytological methods, and the like, as described in detail below.

アプタマーベースのアッセイを使用するバイオマーカーレベルの決定
生物学的サンプル及び他のサンプル中の生理学的に有意な分子の検出及び定量化へ向けたアッセイは、科学研究及びヘルスケア分野における重要なツールである。かかるアッセイの１つのクラスは、固体支持体に固定化された１つまたは複数のアプタマーを含むマイクロアレイの使用を伴う。アプタマーは各々、高度に特異的な様式で且つ非常に高親和性で、標的分子へ結合することが可能である。例えば表題「ＮｕｃｌｅｉｃＡｃｉｄＬｉｇａｎｄｓ」の米国特許第５，４７５，０９６号を参照されたい。例えば米国特許第６，２４２，２４６号、米国特許第６，４５８，５４３号、及び米国特許第６，５０３，７１５号（その各々は、表題「ＮｕｃｌｅｉｃＡｃｉｄＬｉｇａｎｄＤｉａｇｎｏｓｔｉｃＢｉｏｃｈｉｐ」である）も参照されたい。一旦マイクロアレイがサンプルと接触させられたならば、アプタマーは、サンプル中に存在するそれぞれの標的分子へ結合し、それによってバイオマーカーに対応するバイオマーカー値の決定を可能にする。 Determining Biomarker Levels Using Aptamer-Based Assays Assays directed at the detection and quantification of physiologically significant molecules in biological and other samples are important tools in scientific research and healthcare. One class of such assays involves the use of microarrays containing one or more aptamers immobilized on a solid support. Each aptamer is capable of binding to a target molecule in a highly specific manner and with very high affinity. See, e.g., U.S. Pat. No. 5,475,096, entitled "Nucleic Acid Ligands." See also, e.g., U.S. Pat. Nos. 6,242,246, 6,458,543, and 6,503,715, each entitled "Nucleic Acid Ligand Diagnostic Biochip." Once the microarray is contacted with a sample, the aptamers bind to their respective target molecules present in the sample, thereby allowing the determination of a biomarker value corresponding to the biomarker.

本明細書において使用される時、「アプタマー」は、標的分子への特異的結合親和性を有する核酸を指す。親和性相互作用は、程度の問題であることが認識されるが、この文脈において、アプタマーのその標的についての「特異的結合親和性」は、概して、当該アプタマーが、検査サンプル中の他の構成要素へ結合するよりもはるかに高い程度の親和性でその標的へ結合することを意味する。「アプタマー」は、特定のヌクレオチド配列を有する１つのタイプまたは種の核酸分子のコピーのセットである。アプタマーは、任意の好適な数のヌクレオチド（任意の数の化学修飾ヌクレオチドを包含する）を含み得る。「アプタマー（複数）」は、１を超えるえるかかる分子のセットを指す。異なるアプタマーは、同じ数または異なる数のいずれかのヌクレオチドを有し得る。アプタマーは、ＤＮＡもしくはＲＮＡまたは化学修飾核酸であり得、一本鎖であり得るか、二本鎖であり得るか、または二本鎖領域を含有し得、より高次の構造を含み得る。アプタマーは、フォトアプタマーでもあり得、そこで、アプタマーがその対応する標的へ共有結合で連結されることを可能にするように、光反応性または化学反応性の官能基がアプタマー中に含まれる。本明細書において開示されるアプタマー方法のうちの任意のものは、特異的に同じ標的分子を結合する２つ以上のアプタマーの使用を包含し得る。さらに下で記載されるように、アプタマーは、タグを含み得る。アプタマーがタグを含むならば、アプタマーのすべてのコピーは同じタグを有する必要はない。さらに、異なるアプタマー各々がタグを含むならば、これらの異なるアプタマーは、同じタグまたは異なるタグのいずれかを有し得る。As used herein, "aptamer" refers to a nucleic acid that has specific binding affinity for a target molecule. While it is recognized that affinity interactions are a matter of degree, in this context, "specific binding affinity" of an aptamer for its target generally means that the aptamer binds to its target with a degree of affinity that is significantly greater than the aptamer's binding to other components in a test sample. An "aptamer" is a set of copies of one type or species of nucleic acid molecule having a specific nucleotide sequence. Aptamers can contain any suitable number of nucleotides, including any number of chemically modified nucleotides. "Aptamers" refers to a set of more than one such molecule. Different aptamers can have either the same or different numbers of nucleotides. Aptamers can be DNA or RNA or chemically modified nucleic acids and can be single-stranded, double-stranded, or contain double-stranded regions and higher-order structures. Aptamers can also be photoaptamers, in which a photoreactive or chemically reactive functional group is included in the aptamer to allow the aptamer to be covalently linked to its corresponding target. Any of the aptamer methods disclosed herein can involve the use of two or more aptamers that specifically bind the same target molecule. As described further below, aptamers can include a tag. If an aptamer includes a tag, not all copies of the aptamer need have the same tag. Furthermore, if different aptamers each include a tag, these different aptamers can have either the same tag or different tags.

アプタマーは、任意の公知の方法を使用して同定され得、当該方法としては、ＳＥＬＥＸプロセスが挙げられる。一旦同定されたならば、アプタマーは、任意の公知の方法に従って調製または合成され得、当該方法としては、化学的合成方法及び酵素による合成方法が挙げられる。Aptamers can be identified using any known method, including the SELEX process. Once identified, aptamers can be prepared or synthesized according to any known method, including chemical and enzymatic synthesis.

本明細書において使用される時、「ＳＯＭＡｍｅｒ」または解離速度の遅い修飾されたアプタマー（ＳｌｏｗＯｆｆ－ＲａｔｅＭｏｄｉｆｉｅｄＡｐｔａｍｅｒ）は、改善された解離速度特徴を有するアプタマーを指す。ＳＯＭＡｍｅｒは、表題「ＭｅｔｈｏｄｆｏｒＧｅｎｅｒａｔｉｎｇＡｐｔａｍｅｒｓｗｉｔｈＩｍｐｒｏｖｅｄＯｆｆ－Ｒａｔｅｓ．」の米国特許出願公開第２００９／０００４６６７号中で記載される、改善されたＳＥＬＥＸ方法を使用して生成すされ得る。As used herein, "SOMAmer" or slow off-rate modified aptamer refers to an aptamer with improved off-rate characteristics. SOMAmers can be generated using the improved SELEX method described in U.S. Patent Application Publication No. 2009/0004667, entitled "Method for Generating Aptamers with Improved Off-Rates."

「ＳＥＬＥＸ」及び「ＳＥＬＥＸプロセス」という用語は、概して、（１）所望される様式で標的分子と相互作用する（例えば高親和性でタンパク質へ結合する）アプタマーの選択と、（２）これらの選択された核酸の増幅との組み合わせを指すように本明細書において互換的に使用される。ＳＥＬＥＸプロセスは、特異的な標的またはバイオマーカーへの高親和性を備えたアプタマーを同定するために使用され得る。The terms "SELEX" and "SELEX process" are generally used interchangeably herein to refer to the combination of (1) the selection of aptamers that interact with a target molecule in a desired manner (e.g., bind to a protein with high affinity) and (2) the amplification of these selected nucleic acids. The SELEX process can be used to identify aptamers with high affinity for a specific target or biomarker.

ＳＥＬＥＸは、概して、核酸の候補混合物を調製すること、候補混合物を所望される標的分子へ結合させて親和性複合体を形成すること、親和性複合体を未結合の候補核酸から分離すること、核酸を親和性複合体から分離及び単離すること、核酸を精製すること、ならびに特異的アプタマー配列を同定することを包含する。プロセスは、選択されたアプタマーの親和性をさらに微調整するために、複数のラウンドを含み得る。プロセスは、プロセス中の１つまたは複数のポイントで増幅ステップを含み得る。例えば表題「ＮｕｃｌｅｉｃＡｃｉｄＬｉｇａｎｄｓ」の米国特許第５，４７５，０９６号を参照されたい。ＳＥＬＥＸプロセスは、標的に共有結合するアプタマー、そして標的に非共有結合するアプタマーを生成するために使用され得る。例えば表題「ＳｙｓｔｅｍａｔｉｃＥｖｏｌｕｔｉｏｎｏｆＮｕｃｌｅｉｃＡｃｉｄＬｉｇａｎｄｓｂｙＥｘｐｏｎｅｎｔｉａｌＥｎｒｉｃｈｍｅｎｔ：Ｃｈｅｍｉ－ＳＥＬＥＸ．」の米国特許第５，７０５，３３７号を参照されたい。SELEX generally involves preparing a candidate mixture of nucleic acids, binding the candidate mixture to a desired target molecule to form an affinity complex, separating the affinity complex from unbound candidate nucleic acids, separating and isolating the nucleic acids from the affinity complex, purifying the nucleic acids, and identifying a specific aptamer sequence. The process may include multiple rounds to further fine-tune the affinity of the selected aptamer. The process may include an amplification step at one or more points in the process. See, for example, U.S. Patent No. 5,475,096 entitled "Nucleic Acid Ligands." The SELEX process can be used to generate aptamers that bind covalently to a target, as well as aptamers that bind non-covalently to a target. See, for example, U.S. Patent No. 5,705,337, entitled "Systematic Evolution of Nucleic Acid Ligands by Exponential Enrichment: Chemi-SELEX."

ＳＥＬＥＸプロセスは、アプタマーについての改善された特徴（例えば改善されたインビボの安定性または改善された送達特徴等）を付与する修飾ヌクレオチドを含有する高親和性アプタマーを同定するために使用され得る。かかる修飾の例としては、リボース及び／またはリン酸及び／または塩基の位置での化学的置換が挙げられる。修飾ヌクレオチドを含有するＳＥＬＥＸプロセス同定アプタマーは、表題「ＨｉｇｈＡｆｆｉｎｉｔｙＮｕｃｌｅｉｃＡｃｉｄＬｉｇａｎｄｓＣｏｎｔａｉｎｉｎｇＭｏｄｉｆｉｅｄＮｕｃｌｅｏｔｉｄｅｓ」の米国特許第５，６６０，９８５号中で記載され、同文献は、ピリミジンの５’位及び２’位で化学的に修飾されたヌクレオチド誘導体を含有するオリゴヌクレオチドを記載する。米国特許第５，５８０，７３７号（上記を参照）は、２’－アミノ（２’－ＮＨ２）、２’－フルオロ（２’－Ｆ）、及び／または２’－Ｏメチル（２’－ＯＭｅ）により修飾された１つまたは複数のヌクレオチドを含有する高度に特異的アプタマーを記載する。物理的及び化学的な特性が増幅された核酸ライブラリーならびにＳＥＬＥＸ及びフォトＳＥＬＥＸにおけるそれらの使用を記載する、表題「ＳＥＬＥＸａｎｄＰＨＯＴＯＳＥＬＥＸ」の米国特許出願公開第２００９００９８５４９号も参照されたい。The SELEX process can be used to identify high-affinity aptamers containing modified nucleotides that confer improved characteristics on the aptamer, such as improved in vivo stability or improved delivery characteristics. Examples of such modifications include chemical substitutions at the ribose and/or phosphate and/or base positions. SELEX process-identified aptamers containing modified nucleotides are described in U.S. Pat. No. 5,660,985, entitled "High Affinity Nucleic Acid Ligands Containing Modified Nucleotides," which describes oligonucleotides containing nucleotide derivatives chemically modified at the 5' and 2' positions of the pyrimidines. U.S. Patent No. 5,580,737 (see above) describes highly specific aptamers containing one or more nucleotides modified with 2'-amino (2'-NH2), 2'-fluoro (2'-F), and/or 2'-Omethyl (2'-OMe). See also U.S. Patent Application Publication No. 20090098549, entitled "SELEX and PHOTOSELEX," which describes physical and chemical property amplified nucleic acid libraries and their use in SELEX and photoSELEX.

ＳＥＬＥＸは、所望される解離速度特徴を有するアプタマーを同定するために使用され得る。標的分子へ結合し得るアプタマーを生成する改善されたＳＥＬＥＸ方法を記載する、表題「ＭｅｔｈｏｄｆｏｒＧｅｎｅｒａｔｉｎｇＡｐｔａｍｅｒｓｗｉｔｈＩｍｐｒｏｖｅｄＯｆｆ－Ｒａｔｅｓ」の米国特許出願公開第２００９０００４６６７号を参照されたい。上で言及されるように、これらの解離速度の遅いアプタマーは「ＳＯＭＡｍｅｒ」として公知である。それぞれの標的分子からのより遅い解離速度を有するアプタマーまたはＳＯＭＡｍｅｒ及びフォトアプタマーまたはフォトＳＯＭＡｍｅｒを産生する方法が記載される。方法は、候補混合物を標的分子と接触させること、核酸ターゲット複合体の形成を可能にすること、及び速い解離速度の核酸－ターゲット複合体は解離し再形成されないが、遅い解離速度の複合体はインタクトなでままであるように解離速度の遅いものをエンリッチするプロセスを遂行することを含む。追加で、方法は、改善された解離速度性能を備えたアプタマーまたはＳＯＭＡｍｅｒを生成するために、候補核酸混合物の産生において修飾ヌクレオチドの使用を含む。非限定的な例示的な修飾ヌクレオチドとしては、例えば図３及び４中で示される修飾ピリミジンが挙げられる。SELEX can be used to identify aptamers with desired off-rate characteristics. See U.S. Patent Application Publication No. 20090004667, entitled "Method for Generating Aptamers with Improved Off-Rates," which describes an improved SELEX method for generating aptamers capable of binding to target molecules. As mentioned above, these slow-off-rate aptamers are known as "SOMAmers." Methods for producing aptamers or SOMAmers and photoaptamers or photoSOMAmers with slower off-rates from their respective target molecules are described. The method includes contacting the candidate mixture with a target molecule, allowing the formation of nucleic acid-target complexes, and performing a process that enriches for slower dissociation rates such that fast dissociation rate nucleic acid-target complexes dissociate and do not reform, while slower dissociation rate complexes remain intact. Additionally, the method includes the use of modified nucleotides in the production of the candidate nucleic acid mixture to generate aptamers or SOMAmers with improved dissociation rate performance. Non-limiting exemplary modified nucleotides include, for example, modified pyrimidines shown in Figures 3 and 4.

このアッセイのバリエーションは、アプタマーがその標的分子に共有結合するかまたは「光架橋する」ことを可能にする光反応性官能基を含むアプタマーを用いるものである。例えば表題「ＮｕｃｌｅｉｃＡｃｉｄＬｉｇａｎｄＤｉａｇｎｏｓｔｉｃＢｉｏｃｈｉｐ」の米国特許第６，５４４，７７６号を参照されたい。これらの光反応性アプタマーは、フォトアプタマーとも称される。例えば米国特許第５，７６３，１７７号、米国特許第６，００１，５７７号、及び米国特許第６，２９１，１８４号（その各々は、表題「ＳｙｓｔｅｍａｔｉｃＥｖｏｌｕｔｉｏｎｏｆＮｕｃｌｅｉｃＡｃｉｄＬｉｇａｎｄｓｂｙＥｘｐｏｎｅｎｔｉａｌＥｎｒｉｃｈｍｅｎｔ：ＰｈｏｔｏｓｅｌｅｃｔｉｏｎｏｆＮｕｃｌｅｉｃＡｃｉｄＬｉｇａｎｄｓａｎｄＳｏｌｕｔｉｏｎＳＥＬＥＸ」である）を参照されたい。例えば表題「ＰｈｏｔｏｓｅｌｅｃｔｉｏｎｏｆＮｕｃｌｅｉｃＡｃｉｄＬｉｇａｎｄｓ」の米国特許第６，４５８，５３９号も参照されたい。マイクロアレイがサンプルと接触し、フォトアプタマーがその標的分子へ結合する機会があった後に、フォトアプタマーは光活性化され、固体支持体は洗浄されて、任意の非特異的に結合された分子を除去する。概して、フォトアプタマー上の光活性化官能基（複数可）によって生成された共有結合に起因して、フォトアプタマーへ結合された標的分子は除去されないので、厳しい洗浄条件が使用され得る。この様式において、アッセイは、検査サンプル中のバイオマーカーに対応するバイオマーカー値の検出を可能にする。A variation of this assay uses aptamers containing photoreactive functional groups that allow the aptamer to covalently bind or "photocrosslink" to its target molecule. See, for example, U.S. Patent No. 6,544,776, entitled "Nucleic Acid Ligand Diagnostic Biochip." These photoreactive aptamers are also referred to as photoaptamers. See, e.g., U.S. Patent Nos. 5,763,177, 6,001,577, and 6,291,184, each of which is entitled "Systematic Evolution of Nucleic Acid Ligands by Exponential Enrichment: Photoselection of Nucleic Acid Ligands and Solution SELEX." See also, e.g., U.S. Patent No. 6,458,539, entitled "Photoselection of Nucleic Acid Ligands." After the microarray is contacted with the sample and the photoaptamers have had an opportunity to bind to their target molecules, the photoaptamers are photoactivated and the solid support is washed to remove any non-specifically bound molecules. Generally, stringent washing conditions can be used because the target molecules bound to the photoaptamers are not removed due to the covalent bond created by the photoactivated functional group(s) on the photoaptamers. In this manner, the assay allows for the detection of biomarker values corresponding to the biomarkers in the test sample.

これらのアッセイフォーマットの両方において、アプタマーまたはＳＯＭＡｍｅｒは、サンプルとの接触の前に固体支持体上で固定化される。しかしながら、ある特定の状況の下では、サンプルとの接触の前のアプタマーまたはＳＯＭＡｍｅｒの固定化は、最適なアッセイを提供しないかもしれない。例えば、アプタマーまたはＳＯＭＡｍｅｒの前固定化は、固体支持体の表面上でのアプタマーまたはＳＯＭＡｍｅｒと標的分子との非能率的な混合をもたらし、恐らく長い反応時間を導き、したがって、アプタマーまたはＳＯＭＡｍｅｒがそれらの標的分子へ効率的に結合することを可能にするために、インキュベーション期間が延長され得る。さらに、フォトアプタマーまたはフォトＳＯＭＡｍｅｒがアッセイにおいて用いられる場合に、及び固体支持体として利用された材料に依存して、固体支持体は、フォトアプタマーまたはフォトＳＯＭＡｍｅｒとそれらの標的分子との間での共有結合の形成を成し遂げるために使用される光を散乱または吸収する傾向があり得る。さらに、用いられる方法に依存して、アプタマーまたはフォトＳＯＭＡｍｅｒへ結合された標的分子の検出は不正確になりやすいが、それは、固体支持体の表面も使用される任意の標識剤へ曝露され、それによって影響を受け得るからである。最終的に、固体支持体上のアプタマーまたはＳＯＭＡｍｅｒの固定化は、概して、アプタマーまたはＳＯＭＡｍｅｒをサンプルへ曝露する前のアプタマーまたはＳＯＭＡｍｅｒの調製ステップ（すなわち固定化）を含み、この調製ステップは、アプタマーまたはＳＯＭＡｍｅｒの活性または官能性に影響し得る。In both of these assay formats, the aptamer or SOMAmer is immobilized on a solid support prior to contact with the sample. However, under certain circumstances, immobilizing the aptamer or SOMAmer prior to contact with the sample may not provide an optimal assay. For example, pre-immobilization of the aptamer or SOMAmer may result in inefficient mixing of the aptamer or SOMAmer with the target molecule on the surface of the solid support, possibly leading to long reaction times; therefore, incubation periods may need to be extended to allow the aptamer or SOMAmer to efficiently bind to their target molecules. Furthermore, when photoaptamers or photoSOMAmers are used in the assay, and depending on the material utilized as the solid support, the solid support may tend to scatter or absorb light used to achieve covalent bond formation between the photoaptamer or photoSOMAmer and their target molecule. Furthermore, depending on the method used, detection of target molecules bound to aptamers or photoSOMAmers can be prone to inaccuracies because the surface of the solid support may also be exposed to and affected by any labeling agent used. Finally, immobilization of aptamers or SOMAmers on a solid support generally involves a preparation step (i.e., immobilization) of the aptamer or SOMAmer prior to exposing the aptamer or SOMAmer to a sample, which preparation step may affect the activity or functionality of the aptamer or SOMAmer.

ＳＯＭＡｍｅｒが溶液中でその標的を捕捉することを可能にし、次いで、検出の前にＳＯＭＡｍｅｒ－標的混合物の特定の構成要素を除去するようにデザインされている分離ステップを用いる、ＳＯＭＡｍｅｒアッセイも記載されている（表題「ＭｕｌｔｉｐｌｅｘｅｄＡｎａｌｙｓｅｓｏｆＴｅｓｔＳａｍｐｌｅｓ」の米国特許出願公開第２００９００４２２０６号を参照）。記載されるＳＯＭＡｍｅｒアッセイ方法は、核酸（すなわちＳＯＭＡｍｅｒ）の検出及び定量によって、検査サンプル中の非核酸標的（例えばタンパク質標的）の検出及び定量化を可能にする。記載される方法は、非核酸標的の検出及び定量のための核酸代用物（すなわちＳＯＭＡｍｅｒ）を生成し、これにより、様々な核酸テクノロジー（増幅が挙げられる）が、広範囲の所望される標的（タンパク質標的が挙げられる）へ適用されることを可能にする。SOMAmer assays have also been described that allow SOMAmers to capture their targets in solution, followed by a separation step designed to remove specific components of the SOMAmer-target mixture prior to detection (see U.S. Patent Application Publication No. 20090042206, entitled "Multiplexed Analyses of Test Samples"). The described SOMAmer assay methods enable the detection and quantification of non-nucleic acid targets (e.g., protein targets) in test samples through the detection and quantification of nucleic acids (i.e., SOMAmers). The described methods generate nucleic acid surrogates (i.e., SOMAmers) for the detection and quantification of non-nucleic acid targets, thereby enabling various nucleic acid technologies (including amplification) to be applied to a wide range of desired targets (including protein targets).

ＳＯＭＡｍｅｒは、ＳＯＭＡｍｅｒバイオマーカー複合体（またはフォトＳＯＭＡｍｅｒバイオマーカー共有結合複合体）からのアッセイ構成要素の分離を助長し、検出及び／または定量化のためにＳＯＭＡｍｅｒの単離を可能にするように構築され得る。いくつかの実施形態において、これらのコンストラクトは、ＳＯＭＡｍｅｒ配列内の切断可能または放出可能なエレメントを含み得る。他の実施形態において、追加の機能性は、ＳＯＭＡｍｅｒの中へ導入され得、例えば標識もしくは検出可能な構成要素、スペーサー構成要素、または特異的結合タグもしくは固定化エレメントである。例えば、ＳＯＭＡｍｅｒは、切断可能部分、標識、標識を分離するスペーサー構成要素、及び切断可能部分を介してＳＯＭＡｍｅｒへ接続されるタグを含み得る。一実施形態において、切断可能要素は、光切断可能リンカーである。光切断可能リンカーは、ビオチン部分及びスペーサーセクションへ結合され得、アミンの誘導体化のためのＮＨＳ基を含み得、ビオチン基をアプタマーへ導入するために使用され、それによって、アッセイ方法における後のアプタマーの放出を可能にし得る。SOMAmers can be constructed to facilitate separation of assay components from the SOMAmer-biomarker complex (or photo-SOMAmer-biomarker covalent complex), allowing for isolation of the SOMAmer for detection and/or quantification. In some embodiments, these constructs can include cleavable or releasable elements within the SOMAmer sequence. In other embodiments, additional functionality can be introduced into the SOMAmer, such as a label or detectable component, a spacer component, or a specific binding tag or immobilization element. For example, a SOMAmer can include a cleavable moiety, a label, a spacer component that separates the label, and a tag connected to the SOMAmer via the cleavable moiety. In one embodiment, the cleavable element is a photocleavable linker. The photocleavable linker can be attached to a biotin moiety and a spacer section and can include an NHS group for amine derivatization, which can be used to introduce a biotin group into the aptamer, thereby allowing for release of the aptamer later in the assay method.

溶液中ですべてのアッセイ構成要素により行った均一アッセイは、シグナルの検出の前のサンプル及び試薬の分離を要求しない。これらの方法は迅速であり、使用が容易である。これらの方法は、分子捕捉または特異的標的と反応する結合試薬に基づいて、シグナルを生成する。タバコ使用状態の予測のために、分子捕捉試薬は、アプタマーもしくは抗体または同種のものであり、特異的標的は表４中のようなタバコ使用バイオマーカーだろう。Homogeneous assays, performed with all assay components in solution, do not require separation of sample and reagents prior to signal detection. These methods are rapid and easy to use. These methods generate signals based on molecular capture or binding reagents that react with specific targets. For prediction of tobacco use status, the molecular capture reagent would be an aptamer or antibody or the like, and the specific target would be a tobacco use biomarker such as those in Table 4.

いくつかの実施形態において、シグナル生成のための方法は、フルオロフォア標識捕捉試薬とその特異的バイオマーカー標的との相互作用に起因する異方性シグナル変化を利用する。標識された捕捉試薬がその標的と反応する場合に、分子量の増加は、複合体へ結合されたフルオロフォアの回転運動を非常に遅くし、異方性値を変化させる。異方性変化のモニタリングによって、結合事象を使用して、溶液中のバイオマーカーが定量的に測定され得る。他の方法としては、蛍光偏光アッセイ、分子ビーコン方法、時間分解蛍光クエンチング、化学発光、蛍光共鳴エネルギー転移、及び同種のものが挙げられる。In some embodiments, methods for signal generation utilize the anisotropic signal change resulting from the interaction of a fluorophore-labeled capture reagent with its specific biomarker target. When the labeled capture reagent reacts with its target, the increase in molecular weight significantly slows the rotational motion of the fluorophore bound to the complex, changing the anisotropy value. By monitoring the anisotropy change, the binding event can be used to quantitatively measure the biomarker in solution. Other methods include fluorescence polarization assays, molecular beacon methods, time-resolved fluorescence quenching, chemiluminescence, fluorescence resonance energy transfer, and the like.

生物学的サンプル中のバイオマーカーに対応するバイオマーカー値を検出するために使用され得る例示的な溶液ベースのアプタマーアッセイは、以下の：（ａ）生物学的サンプルを、第１のタグを含み、バイオマーカーへの特異的親和性を有するアプタマーと接触させることによって、混合物を調製し、バイオマーカーが、サンプル中に存在する場合に、アプタマー親和性複合体が形成されること；（ｂ）混合物を第１の捕捉エレメントを含む第１の固体支持体へ曝露し、第１のタグが第１の捕捉エレメントと会合することを可能にすること；（ｃ）第１の固体支持体と会合しない混合物の任意の構成要素を除去すること；（ｄ）第２のタグをアプタマー親和性複合体のバイオマーカー構成要素へ結合させること；（ｅ）アプタマー親和性複合体を第１の固体支持体から放出させこと；（ｆ）放出されたアプタマー親和性複合体を第２の捕捉エレメントを含む第２の固体支持体へ曝露し、第２のタグが第２の捕捉エレメントと会合することを可能にすること；（ｇ）非複合体化アプタマーをアプタマー親和性複合体から区分することによって、任意の非複合体化アプタマーを混合物から除去すること；（ｈ）固体支持体からアプタマーを溶出すること；及び（ｉ）アプタマー親和性複合体のアプタマー構成要素の検出によって、バイオマーカーを検出することを含む。An exemplary solution-based aptamer assay that can be used to detect a biomarker value corresponding to a biomarker in a biological sample involves the following steps: (a) preparing a mixture by contacting the biological sample with an aptamer that includes a first tag and has specific affinity for the biomarker, such that if the biomarker is present in the sample, an aptamer affinity complex is formed; (b) exposing the mixture to a first solid support that includes a first capture element, allowing the first tag to associate with the first capture element; (c) removing any components of the mixture that do not associate with the first solid support; (d) removing any components of the mixture that do not associate with the second solid support. (e) binding the tag to the biomarker component of the aptamer affinity complex; (e) releasing the aptamer affinity complex from the first solid support; (f) exposing the released aptamer affinity complex to a second solid support comprising a second capture element, allowing the second tag to associate with the second capture element; (g) removing any uncomplexed aptamer from the mixture by partitioning the uncomplexed aptamer from the aptamer affinity complex; (h) eluting the aptamer from the solid support; and (i) detecting the biomarker by detecting the aptamer component of the aptamer affinity complex.

当技術分野において公知の任意の手段が、アプタマー親和性複合体のアプタマー構成要素の検出によって、バイオマーカー値を検出するために使用され得る。多数の異なる検出方法が、親和性複合体のアプタマー構成要素を検出するために使用され得、例えばハイブリダイゼーションアッセイ、質量分光法、またはＱＰＣＲ等である。いくつかの実施形態において、核酸シーケンシング方法は、アプタマー親和性複合体のアプタマー構成要素を検出し、それによってバイオマーカー値を検出するために使用され得る。簡潔には、検査サンプルは、検査サンプル中に存在する１つまたは複数のアプタマーの配列（複数可）を同定及び定量するために、任意の種類の核酸シーケンシング方法へ供され得る。いくつかの実施形態において、配列は、アプタマー分子全体、または分子を一意的に同定するために使用され得る分子の任意の部分を含む。他の実施形態において、識別配列は、アプタマーへ添加される特異的配列であり、かかる配列は多くの場合、「タグ」、「バーコード」、または「ジップコード」と称される。いくつかの実施形態において、シーケンシング方法は、アプタマー配列を増幅するため、または任意の種類の核酸（任意の位置への化学物質修飾を含有するＲＮＡ及びＤＮＡが挙げられる）をシーケンシングのために適切な他の種類の核酸へ転換するための酵素ステップを含む。Any means known in the art can be used to detect biomarker values by detecting the aptamer component of an aptamer affinity complex. Many different detection methods can be used to detect the aptamer component of an affinity complex, such as hybridization assays, mass spectroscopy, or QPCR. In some embodiments, nucleic acid sequencing methods can be used to detect the aptamer component of an aptamer affinity complex and thereby detect biomarker values. Briefly, a test sample can be subjected to any type of nucleic acid sequencing method to identify and quantify one or more aptamer sequence(s) present in the test sample. In some embodiments, the sequence comprises the entire aptamer molecule or any portion of the molecule that can be used to uniquely identify the molecule. In other embodiments, an identification sequence is a specific sequence added to the aptamer; such sequences are often referred to as "tags," "barcodes," or "zip codes." In some embodiments, the sequencing method includes an enzymatic step to amplify the aptamer sequence or to convert any type of nucleic acid (including RNA and DNA containing chemical modifications at any position) into other types of nucleic acid suitable for sequencing.

いくつかの実施形態において、シーケンシング方法は、１つまたは複数のクローニングステップを含む。他の実施形態において、シーケンシング方法は、直接シーケンシング法を包含する。In some embodiments, the sequencing method includes one or more cloning steps. In other embodiments, the sequencing method includes direct sequencing.

いくつかの実施形態において、シーケンシング方法は、検査サンプル中の１つまたは複数のアプタマーを標的化する特異的プライマーによる指向性のアプローチを包含する。他の実施形態において、シーケンシング方法は、検査サンプル中のすべてアプタマーを標的化するショットガンアプローチを包含する。In some embodiments, the sequencing method involves a directed approach using specific primers that target one or more aptamers in the test sample. In other embodiments, the sequencing method involves a shotgun approach that targets all aptamers in the test sample.

いくつかの実施形態において、シーケンシング方法は、シーケンシングのために標的化された分子を増幅するための酵素ステップを含む。他の実施形態において、シーケンシング方法は、単一分子を直接シーケンスする。生物学的サンプル中のバイオマーカーに対応するバイオマーカー値を検出するために使用され得る例示的な核酸シーケンシングベースの方法は、以下の：（ａ）化学修飾ヌクレオチドを含有するアプタマーの混合物を非修飾核酸へ酵素ステップにより変換すること；（ｂ）もたらされた非修飾核酸を、超並列シーケンシングプラットフォーム（４例えば５４シーケンシングシステム（４５４ＬｉｆｅＳｃｉｅｎｃｅｓ／Ｒｏｃｈｅ）、Ｉｌｌｕｍｉｎａシーケンシングシステム（Ｉｌｌｕｍｉｎａ）、ＡＢＩＳＯＬｉＤシーケンシングシステム（ＡｐｐｌｉｅｄＢｉｏｓｙｓｔｅｍｓ）、ＨｅｌｉＳｃｏｐｅ単一分子シークエンサー（ＨｅｌｉｃｏｓＢｉｏｓｃｉｅｎｃｅｓ）、またはＰａｃｉｆｉｃＢｉｏｓｃｉｅｎｃｅｓリアルタイム単一分子シーケンシングシステム（ＰａｃｉｆｉｃＢｉｏＳｃｉｅｎｃｅｓ）、またはＰｏｌｏｎａｔｏｒＧシーケンシングシステム（ＤｏｖｅｒＳｙｓｔｅｍｓ）等）によりショットガンシーケンシングすること；ならびに（ｃ）特異的配列及び配列カウントによって混合物中に存在するＳＯＭＡｍｅｒを同定及び定量することを含む。In some embodiments, the sequencing method includes an enzymatic step to amplify the targeted molecule for sequencing. In other embodiments, the sequencing method directly sequences a single molecule. An exemplary nucleic acid sequencing-based method that can be used to detect biomarker values corresponding to biomarkers in a biological sample involves: (a) converting a mixture of aptamers containing chemically modified nucleotides into unmodified nucleic acids by an enzymatic step; (b) sequencing the resulting unmodified nucleic acids on a massively parallel sequencing platform (e.g., 454 Sequencing System (454 Life Sciences/Roche), Illumina Sequencing System (Illumina), ABI SOLiD Sequencing System (Applied Biosystems), HeliScope Single Molecule Sequencer (Helicos Biosciences), or Pacific Biosciences Real-Time Single Molecule Sequencing System (Pacific BioSciences), or Polonator G Sequencing System (Dover). (c) identifying and quantifying SOMAmers present in the mixture by specific sequences and sequence counts; and (d) performing shotgun sequencing using a PCR amplification system (e.g., PCR Amplification Systems).

免疫アッセイを使用するバイオマーカー値の決定
免疫アッセイ法は、その対応する標的または分析物への抗体の反応に基づいており、特異的アッセイフォーマットに依存してサンプル中の分析物を検出し得る。免疫反応性に基づくアッセイ方法の特異度及び感度を改善するために、モノクローナル抗体は、それらの特異的エピトープ認識のために、多くの場合使用される。モノクローナル抗体に比較して、標的への親和性が増加するので、ポリクローナル抗体も様々な免疫アッセイにおいて成功して使用されてきた。免疫アッセイは、広範囲の生物学的サンプルマトリックスによる使用のためにデザインされている。免疫アッセイフォーマットは、定性的、半定量的、及び定量的な結果を提供するようにデザインされている。 Determining Biomarker Values Using Immunoassays Immunoassays are based on the reaction of antibodies with their corresponding targets or analytes and can detect the analyte in a sample depending on the specific assay format. To improve the specificity and sensitivity of immunoreactivity-based assay methods, monoclonal antibodies are often used due to their specific epitope recognition. Polyclonal antibodies have also been successfully used in various immunoassays due to their increased affinity to the target compared to monoclonal antibodies. Immunoassays are designed for use with a wide range of biological sample matrices. Immunoassay formats are designed to provide qualitative, semi-quantitative, and quantitative results.

定量的結果は、検出されるべき特異的分析物の公知の濃度により生成されたスタンダードカーブの使用を介して生成される。未知のサンプルからの応答またはシグナルは、スタンダードカーブの上へプロットされ、未知のサンプル中の標的に対応する量または値が確立される。Quantitative results are generated through the use of a standard curve generated with known concentrations of the specific analyte to be detected. The response or signal from an unknown sample is plotted onto the standard curve, and the amount or value corresponding to the target in the unknown sample is established.

多数の免疫アッセイフォーマットがデザインされてきた。ＥＬＩＳＡまたはＥＩＡは、分析物の検出について定量的であり得る。この方法は、分析物または抗体のいずれかへの標識の結合に依存し、標識構成要素は、酵素を直接的または間接的のいずれかで含む。ＥＬＩＳＡ検査は、分析物を、直接的に、間接的に、競合的に、またはサンドイッチにより検出するためにフォーマットされ得る。他の方法は、標識（例えば放射性同位体（Ｉ１２５）または蛍光等）に依存するものである。追加の技法としては、例えば凝集、ネフェロメトリー、濁度測定、ウエスタンブロット、免疫沈降、免疫細胞化学、免疫組織化学、フローサイトメトリー、Ｌｕｍｉｎｅｘアッセイ、及び他が挙げられる（ＩｍｍｕｎｏＡｓｓａｙ：ＡＰｒａｃｔｉｃａｌＧｕｉｄｅ、ＢｒｉａｎＬａｗ編、Ｔａｙｌｏｒ＆Ｆｒａｎｃｉｓ，Ｌｔｄ．によって出版、２００５年版を参照）。Numerous immunoassay formats have been designed. ELISA or EIA can be quantitative for the detection of an analyte. The method relies on the attachment of a label to either the analyte or the antibody, with the label component comprising an enzyme, either directly or indirectly. ELISA tests can be formatted for direct, indirect, competitive, or sandwich detection of the analyte. Other methods rely on labels (e.g., radioisotopes (I125) or fluorescence). Additional techniques include agglutination, nephelometry, turbidity, Western blot, immunoprecipitation, immunocytochemistry, immunohistochemistry, flow cytometry, Luminex assays, and others (see ImmunoAssay: A Practical Guide, edited by Brian Law, published by Taylor & Francis, Ltd., 2005 edition).

例示的なアッセイフォーマットとしては、酵素結合免疫吸着アッセイ（ＥＬＩＳＡ）、放射免疫アッセイ、蛍光免疫アッセイ、化学発光免疫アッセイ、及び蛍光共鳴エネルギー転移（ＦＲＥＴ）免疫アッセイまたは時間分解ＦＲＥＴ（ＴＲ－ＦＲＥＴ）免疫アッセイが挙げられる。バイオマーカーを検出するための手順の例としては、バイオマーカー免疫沈降、それに続く、ゲル電気泳動法、キャピラリー電気泳動、平面エレクトロクロマトグラフィー、及び同種のもの等のサイズ及びペプチドレベルの弁別を可能にする定量的方法が挙げられる。Exemplary assay formats include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays, fluorescent immunoassays, chemiluminescent immunoassays, and fluorescence resonance energy transfer (FRET) immunoassays or time-resolved FRET (TR-FRET) immunoassays. Exemplary procedures for detecting biomarkers include biomarker immunoprecipitation followed by quantitative methods that allow for size- and peptide-level discrimination, such as gel electrophoresis, capillary electrophoresis, planar electrochromatography, and the like.

検出可能な標識またはシグナル生成物質を検出及び／または定量する方法は、標識の性質に依存する。適切な酵素によって触媒される反応の産物（検出可能な標識が酵素である場合；上を参照）は、限定されずに、蛍光性、発光性、もしくは放射性であり得るか、または可視光もしくは紫外線を吸収し得る。かかる検出可能な標識を検出するのに好適な検出器の例としては、Ｘ線フィルム、放射能カウンター、シンチレーションカウンター、分光光度計、比色計、蛍光測定器、ルミノメーター、及び濃度計が限定されずに挙げられる。Methods for detecting and/or quantifying a detectable label or signal-producing substance depend on the nature of the label. The product of the reaction catalyzed by an appropriate enzyme (when the detectable label is an enzyme; see above) can be, without limitation, fluorescent, luminescent, or radioactive, or can absorb visible or ultraviolet light. Examples of detectors suitable for detecting such detectable labels include, without limitation, X-ray film, radioactivity counters, scintillation counters, spectrophotometers, colorimeters, fluorometers, luminometers, and densitometers.

検出のための方法のうちの任意のものは、反応物の任意の好適な調製、プロセシング、及び分析を可能にする任意のフォーマットにおいて遂行され得る。これは、例えばマルチウェルアッセイプレート（例えば９６ウェルまたは３８４ウェル）で、または任意の好適なアレイもしくはマイクロアレイも使用して、行われ得る。様々な薬剤のためのストック溶液は、手動でまたはロボットにより作製され得、後続するすべてのピペッティング、希釈、混合、分布、洗浄、インキュベーション、サンプル読み取り、データ収集、及び分析は、検出可能な標識を検出することが可能な商業的に入手可能な分析ソフトウェア、ロボット工学、及び検出装置類を使用して、ロボットにより行われ得る。Any of the methods for detection can be performed in any format that allows for any suitable preparation, processing, and analysis of the reactants. This can be done, for example, in multiwell assay plates (e.g., 96-well or 384-well) or using any suitable array or microarray. Stock solutions for various agents can be made manually or robotically, and all subsequent pipetting, dilution, mixing, distribution, washing, incubation, sample reading, data collection, and analysis can be performed robotically using commercially available analysis software, robotics, and detection equipment capable of detecting the detectable label.

遺伝子発現プロファイリングを使用するバイオマーカー値の決定
生物学的サンプル中のｍＲＮＡの測定は、生物学的サンプル中の対応するタンパク質のレベルの検出についての代用物として使用され得る。したがって、本明細書において記載されるバイオマーカーまたはバイオマーカーパネルのうちの任意のものは、適切なＲＮＡの検出によっても検出され得る。 Determining Biomarker Values Using Gene Expression Profiling Measurement of mRNA in a biological sample can be used as a surrogate for detecting the level of the corresponding protein in the biological sample. Thus, any of the biomarkers or biomarker panels described herein can also be detected by detection of the appropriate RNA.

ｍＲＮＡ発現レベルは、逆転写定量的ポリメラーゼ連鎖反応（ｑＰＣＲが続くＲＴ－ＰＣＲ）によって測定される。ＲＴ－ＰＣＲは、ｍＲＮＡからｃＤＮＡを生成するために使用される。ｃＤＮＡは、ＤＮＡ増幅プロセスが進行するにつれて蛍光を産生するように、ｑＰＣＲアッセイにおいて使用され得る。スタンダードカーブへの比較によって、ｑＰＣＲは、絶対測定値（１つの細胞あたりｍＲＮＡのコピーの数等）を生じ得る。ノーザンブロット、マイクロアレイ、インベーダーアッセイ、及びキャピラリー電気泳動と組み合わせたＲＴ－ＰＣＲは、サンプル中のｍＲＮＡの発現レベルを測定するためにすべて使用されている。ＧｅｎｅＥｘｐｒｅｓｓｉｏｎＰｒｏｆｉｌｉｎｇ：ＭｅｔｈｏｄｓａｎｄＰｒｏｔｏｃｏｌｓ，ＲｉｃｈａｒｄＡ．Ｓｈｉｍｋｅｔｓ，ｅｄｉｔｏｒ，ＨｕｍａｎａＰｒｅｓｓ，２００４を参照されたい。mRNA expression levels are measured by reverse transcription quantitative polymerase chain reaction (RT-PCR followed by qPCR). RT-PCR is used to generate cDNA from mRNA. The cDNA can be used in a qPCR assay to produce fluorescence as the DNA amplification process progresses. By comparison to a standard curve, qPCR can generate absolute measurements (such as the number of mRNA copies per cell). Northern blots, microarrays, Invader assays, and RT-PCR combined with capillary electrophoresis have all been used to measure mRNA expression levels in samples. See Gene Expression Profiling: Methods and Protocols, Richard A. Shimkets, editor, Humana Press, 2004.

ｍｉＲＮＡ分子は、非コーディングであるが、遺伝子発現を調節し得る小分子ＲＮＡである。ｍＲＮＡ発現レベルの測定値に適した方法のうちの任意のものは、対応するｍｉＲＮＡのためにも使用され得る。最近、多くの検査室が、疾患についてのバイオマーカーとしてｍｉＲＮＡの使用を調査している。多くの疾患が広範囲の転写調節を伴い、ｍｉＲＮＡにバイオマーカーとしての役割が見出され得るのは驚くべきことではない。ｍｉＲＮＡ濃度と疾患との間の関係は、多くの場合、タンパク質レベルと疾患との間の関係ほど明らかではないにもかかわらず、ｍｉＲＮＡバイオマーカーの価値は実質的であり得る。当然のことながら、疾患の最中に差次的に発現される任意のＲＮＡでのように、インビトロの診断製品の開発が直面する問題としては、ｍｉＲＮＡが疾患細胞中で残存し、分析のために容易に抽出されるいう要件、またはｍｉＲＮＡが血液もしくは他のマトリックスの中へ放出され、そこで、測定されるために十分に長く残存しなくてはならないという要件が挙げられる。タンパク質バイオマーカーには、類似する要件があるが、多くの可能性のあるタンパク質バイオマーカーは、疾患の最中に病理及び機能の部位でパラクライン様式で意図的に分泌される。多くの可能性のあるタンパク質バイオマーカーは、それらのタンパク質が合成される細胞の外で機能するようにデザインされる。MiRNA molecules are small RNA molecules that are non-coding but can regulate gene expression. Any method suitable for measuring mRNA expression levels can also be used for the corresponding miRNA. Recently, many laboratories have been investigating the use of miRNAs as biomarkers for disease. Many diseases involve widespread transcriptional regulation, so it is not surprising that miRNAs may find a role as biomarkers. Although the relationship between miRNA concentrations and disease is often less clear than the relationship between protein levels and disease, the value of miRNA biomarkers can be substantial. Of course, as with any RNA that is differentially expressed during disease, challenges facing the development of in vitro diagnostic products include the requirement that the miRNA persist in diseased cells and be easily extracted for analysis, or that the miRNA be released into blood or other matrices, where it must remain long enough to be measured. Protein biomarkers have similar requirements, although many potential protein biomarkers are intentionally secreted in a paracrine manner at sites of pathology and function during disease. Many potential protein biomarkers are designed to function outside the cells in which the proteins are synthesized.

インビボ分子イメージングテクノロジーを使用するバイオマーカーの検出
記載されるバイオマーカー（例えば表４を参照）のうちの任意のものが、分子イメージング検査において使用され得る。例えば、造影剤は、記載されるバイオマーカーのうちの任意のものへカップリングされ、他の使用の中でも、タバコ使用状態の査定を支援するため、治療介入への応答をモニタリングするため、臨床試験のための集団を選択するために使用され得る。 Detection of Biomarkers Using In Vivo Molecular Imaging Technologies Any of the described biomarkers (see, e.g., Table 4) can be used in molecular imaging studies. For example, imaging agents can be coupled to any of the described biomarkers and used to aid in the assessment of tobacco use status, monitor response to therapeutic interventions, and select populations for clinical trials, among other uses.

インビボイメージング技術は、個体の身体中の特定の疾患または病態の状態を決定するための非侵襲性方法を提供する。例えば、身体の部分の全体または身体の全体でさえ、三次元イメージとして観察され、それによって、身体における形態及び構造に関する有益な情報が提供される。かかる技術は、個体のタバコ使用状態に関する情報を提供するために本明細書において記載されるバイオマーカーの検出と組み合わせられ得る。In vivo imaging techniques provide a non-invasive method for determining the status of a particular disease or condition in an individual's body. For example, entire body parts or even the entire body can be viewed as three-dimensional images, thereby providing useful information about the morphology and structure of the body. Such techniques can be combined with the detection of biomarkers described herein to provide information about an individual's tobacco use status.

インビボの分子イメージング技術の使用は、技術の様々な進歩に起因して拡大している。これらの進歩としては、身体内で強いシグナルを提供し得る新規のコントラスト剤または標識（放射性同位体標識及び／または蛍光標識等）の開発；ならびに有用な情報を提供するのに十分な感度及び精密度により身体の外からのこれらのシグナルを検出及び分析し得る強力な新規のイメージング技術の開発が挙げられる。コントラスト剤は、適切なイメージングシステムにおいて可視化され得、それによって、コントラスト剤が所在する身体の部分（複数可）のイメージを提供する。コントラスト剤は、捕捉試薬（例えばアプタマーまたは抗体等）、ならびに／あるいはペプチドもしくはタンパク質、またはオリゴヌクレオチド（例えば遺伝子発現の検出のため）、または１つもしくは複数のマクロ分子及び／もしくは他の微粒子形態と共にこれらのうちの任意のものを含有する複合体へ結合され得るか、またはそれらと会合し得る。The use of in vivo molecular imaging techniques is expanding due to various advances in technology. These advances include the development of new contrast agents or labels (such as radioisotope and/or fluorescent labels) that can provide strong signals within the body; and the development of powerful new imaging techniques that can detect and analyze these signals from outside the body with sufficient sensitivity and precision to provide useful information. Contrast agents can be visualized in an appropriate imaging system, thereby providing an image of the body part(s) in which they are located. Contrast agents can be bound to or associated with capture agents (such as aptamers or antibodies), and/or peptides or proteins, or oligonucleotides (e.g., for detecting gene expression), or complexes containing any of these along with one or more macromolecules and/or other particulate forms.

コントラスト剤は、またイメージングにおいて有用な放射性原子を特色とし得る。好適な放射性原子としては、シンチグラフ研究のための、テクネチウム－９９ｍまたはヨウ素－１２３が挙げられる。他の容易に検出可能な部分としては、例えば磁気共鳴イメージング（ＭＲＩ）のためのスピン標識（例えば繰り返すがヨウ素－１２３、ヨウ素－１３１、インジウム－１１１、フッ素－１９、炭素－１３、窒素－１５、酸素－１７、ガドリニウム、マンガン、または鉄等）が挙げられる。かかる標識は、当技術分野において周知であり、当業者によって容易に選択され得る。Contrast agents may also feature radioactive atoms useful in imaging. Suitable radioactive atoms include technetium-99m or iodine-123 for scintigraphic studies. Other readily detectable moieties include spin labels for magnetic resonance imaging (MRI) (e.g., again, iodine-123, iodine-131, indium-111, fluorine-19, carbon-13, nitrogen-15, oxygen-17, gadolinium, manganese, or iron). Such labels are well known in the art and can be readily selected by one of ordinary skill in the art.

標準的なイメージング技法としては、磁気共鳴イメージング、コンピューター断層撮影スキャン（冠動脈カルシウムスコア）、陽電子放出断層撮影（ＰＥＴ）、単一光子放出コンピューター断層撮影（ＳＰＥＣＴ）、コンピューター断層撮影血管造影、及び同種のものが挙げられるがこれらに限定されない。診断用のインビボイメージングについては、利用可能な検出装置のタイプは、所与のコントラスト剤（所与の放射性核種等）及びそれを使用して標的化される特定のバイオマーカー（タンパク質、ｍＲＮＡ、及び同種のもの）の選択において主要な因子である。選ばれた放射性核種は、典型的には、所与のタイプの装置によって検出可能なタイプの崩壊を有する。またインビボの診断のために放射性核種を選択する場合に、その半減期は、標的組織による最大の取り込みの時間での検出を可能にするほどに十分に長いが、宿主への有害な放射線が最小限にされるほどに十分に短くあるべきである。Standard imaging techniques include, but are not limited to, magnetic resonance imaging, computed tomography scans (coronary artery calcium scoring), positron emission tomography (PET), single-photon emission computed tomography (SPECT), computed tomography angiography, and the like. For diagnostic in vivo imaging, the type of detection instrument available is a major factor in the selection of a given contrast agent (such as a given radionuclide) and the particular biomarker (protein, mRNA, and the like) to be targeted with it. The radionuclide chosen typically has a type of decay that is detectable by a given type of instrument. Also, when selecting a radionuclide for in vivo diagnosis, its half-life should be long enough to allow detection at the time of maximum uptake by the target tissue, yet short enough so that harmful radiation to the host is minimized.

例示的なイメージング技法としては、ＰＥＴ及びＳＰＥＣＴが挙げられるがこれらに限定されず、これらは、放射性核種が合成的にまたは局所的に個体へ投与されるイメージング技法である。放射性トレーサーの後続する取り込みは経時的に測定され、標的化された組織及びバイオマーカーについての情報を得るために使用される。用いられる特定の同位体が高エネルギー（ガンマ線）で放射し、それらを検出するために使用される装置が感度があり洗練されているので、放射能の二次元分布は身体の外側から推測され得る。Exemplary imaging techniques include, but are not limited to, PET and SPECT, which are imaging techniques in which a radionuclide is administered synthetically or locally to an individual. The subsequent uptake of the radiotracer is measured over time and used to obtain information about the targeted tissue and biomarkers. Because the particular isotopes used emit with high energy (gamma rays) and the equipment used to detect them is sensitive and sophisticated, the two-dimensional distribution of radioactivity can be inferred from outside the body.

ＰＥＴにおいて一般的に使用される陽電子放出としては、例えば炭素－１１、窒素－１３、酸素－１５、及びフッ素－１８が挙げられる。電子捕捉及び／またはガンマ放射によって崩壊する同位体がＳＰＥＣＴにおいて使用され、当該同位体としては、例えばヨウ素－１２３及びテクネチウム－９９ｍが挙げられる。テクネチウム－９９ｍによりアミノ酸を標識する例示的な方法は、キレート前駆体の存在下において過テクネチウム酸イオンを還元して、不安定なテクネチウム９９ｍ前駆体複合体を形成し、これは、今度は、二官能性に修飾された走化性ペプチドの金属結合基と反応して、テクネチウム９９ｍ走化性ペプチドコンジュゲートを形成する。Positron-emitting isotopes commonly used in PET include, for example, carbon-11, nitrogen-13, oxygen-15, and fluorine-18. Isotopes that decay by electron capture and/or gamma emission are used in SPECT, including, for example, iodine-123 and technetium-99m. An exemplary method for labeling amino acids with technetium-99m involves reducing pertechnetate ions in the presence of a chelate precursor to form an unstable technetium-99m precursor complex, which in turn reacts with the metal-binding group of a bifunctionally modified chemotactic peptide to form a technetium-99m chemotactic peptide conjugate.

抗体は、かかるインビボのイメージング診断方法のために頻繁に使用される。インビボの診断のための抗体の調製及び使用は、当技術分野において周知である。表４中のバイオマーカーのうちの任意のものに特異的に結合する標識抗体は、個体の疾患状態または病態の診断または評価の目的のために、使用される特定のバイオマーカーに従って検出可能な個体の中へ注射され得る。使用される標識は、以前に記載されるように、使用されるイメージング様式に従って選択される。標識の局在化は、個体がこれまでにタバコ使用者であった確率に関連する組織損傷または他の指標の決定を可能にする。器官または組織内の標識の量は、その器官または組織において「これまでに」タバコ使用者であることに起因するバイオマーカーの関与の決定も可能にする。Antibodies are frequently used for such in vivo imaging diagnostic methods. The preparation and use of antibodies for in vivo diagnostics is well known in the art. Labeled antibodies that specifically bind to any of the biomarkers in Table 4 can be injected into an individual, detectable according to the particular biomarker used, for purposes of diagnosing or assessing the individual's disease state or condition. The label used is selected according to the imaging modality used, as previously described. Localization of the label allows for the determination of tissue damage or other indicators related to the probability that the individual was a former tobacco user. The amount of label within an organ or tissue also allows for the determination of the contribution of biomarkers attributable to "ever" tobacco users in that organ or tissue.

同様に、アプタマーは、かかるインビボイメージング診断方法のために使用され得る。例えば、表４中で記載される特定のバイオマーカーを同定するために使用される（したがって、その特定のバイオマーカーへ特異的に結合する）アプタマーは、個体におけるタバコ使用状態と関連する組織損傷、アテローム性プラーク、炎症応答の構成要素、及び他の因子のレベルの診断または評価の目的のために、適切に標識され得、特定のバイオマーカーに従って検出可能なタバコ使用状態の決定のために評価されている個体の中へ注射され得る。使用される標識は、以前に記載されるように、使用されるイメージング様式に従って選択される。標識の局在化は、リスクの増加を導くプロセスのある部位の決定を可能にする。器官または組織内の標識の量は、その器官または組織における病理学的なプロセスの浸潤の決定も可能にする。アプタマー指向性の造影剤は、他の造影剤に比較して、組織透過性、組織分布、動態、排出、効力、及び選択性に関連する固有且つ有利な特徴を有し得る。Similarly, aptamers can be used for such in vivo imaging diagnostic methods. For example, aptamers used to identify (and therefore specifically bind to) specific biomarkers described in Table 4 can be appropriately labeled and injected into an individual being evaluated for the determination of detectable tobacco use status according to the specific biomarker for purposes of diagnosis or assessment of levels of tissue damage, atherosclerotic plaque, components of the inflammatory response, and other factors associated with tobacco use status in the individual. The label used is selected according to the imaging modality to be used, as previously described. Localization of the label allows for the determination of the site of processes leading to increased risk. The amount of label within an organ or tissue also allows for the determination of the infiltration of pathological processes in that organ or tissue. Aptamer-directed imaging agents can have unique and advantageous characteristics related to tissue permeability, tissue distribution, kinetics, excretion, efficacy, and selectivity compared to other imaging agents.

かかる技法は、任意選択で、例えばアンチセンスオリゴヌクレオチドによるイメージングを介する遺伝子発現の検出のために、標識されたオリゴヌクレオチドでも遂行され得る。これらの方法は、例えば標識として、蛍光分子または放射性核種によるインサイチューハイブリダイゼーションのために使用される。遺伝子発現の検出のための他の方法としては、例えばレポーター遺伝子の活性の検出が挙げられる。Such techniques can also optionally be performed with labeled oligonucleotides, for example, for detecting gene expression via imaging with antisense oligonucleotides. These methods are used, for example, for in situ hybridization with fluorescent molecules or radionuclides as labels. Other methods for detecting gene expression include, for example, detecting the activity of reporter genes.

別の一般的なタイプのイメージング技術は、対象内の蛍光シグナルが対象の外側にある光学デバイスによって検出される光学イメージングである。これらのシグナルは、実際の蛍光及び／または生物発光に起因し得る。光検出デバイスの感度の改善により、インビボ診断アッセイのための光学イメージングの有用性を増加した。Another common type of imaging technique is optical imaging, in which fluorescent signals within a subject are detected by an optical device outside the subject. These signals can result from actual fluorescence and/or bioluminescence. Improvements in the sensitivity of optical detection devices have increased the utility of optical imaging for in vivo diagnostic assays.

インビボ分子バイオマーカーイメージングの使用は、臨床試験のためのものを包含し、例えば疾患もしくは病態の新規の治療法のための試験において臨床的有効性をより迅速に測定するために、及び／またはプラセボによる長期の治療が倫理的に疑わしいと判断され得る疾患（多発性硬化症等）についてのかかる長期の治療を回避するために、増加している。The use of in vivo molecular biomarker imaging is increasing, including for clinical trials, to more rapidly measure clinical efficacy in testing new treatments for diseases or conditions, and/or to avoid long-term placebo treatment for diseases where such treatment may be deemed ethically questionable (e.g., multiple sclerosis).

他の技法の総説については、Ｎ．Ｂｌｏｗ，ＮａｔｕｒｅＭｅｔｈｏｄｓ，６，４６５－４６９，２００９を参照されたい。For a review of other techniques, see N. Blow, Nature Methods, 6, 465-469, 2009.

質量分析法方法を使用するバイオマーカー値の決定
様々な構成の質量分析計は、バイオマーカー値を検出するために使用され得る。複数のタイプの質量分析計は利用可能であるか、または様々な構成で製造され得る。概して、質量分析計は、以下の主要な構成要素：サンプル注入口、イオン源、質量分析器、検出器、真空システム、及び装置制御システム、及びデータシステムを有する。サンプル注入口、イオン源、及び質量分析器における差は、概して、装置のタイプ及びその能力を定義する。例えば、注入口は、キャピラリーカラム液体クロマトグラフィー源であり得るか、またはマトリックス支援レーザー脱離等において使用されるダイレクトプローブもしくはステージであり得る。一般的なイオン源は、例えばエレクトロスプレー（ナノスプレー及びマイクロスプレーが挙げられる）またはマトリックス支援レーザー脱離である。一般的な質量分析器としては、四重極質量フィルター、イオントラップ質量分析器、及び飛行時間型質量分析器が挙げられる。追加の質量分析法方法は、当技術分野において周知である（Ｂｕｒｌｉｎｇａｍｅｅｔａｌ．Ａｎａｌ．Ｃｈｅｍ．７０：６４７Ｒ－７１６Ｒ（１９９８）；ＫｉｎｔｅｒａｎｄＳｈｅｒｍａｎ，ＮｅｗＹｏｒｋ（２０００）を参照）。 Determining Biomarker Values Using Mass Spectrometric Methods Mass spectrometers of various configurations can be used to detect biomarker values. Multiple types of mass spectrometers are available or can be manufactured in various configurations. Generally, a mass spectrometer has the following major components: a sample inlet, an ion source, a mass analyzer, a detector, a vacuum system, and an instrument control system and a data system. The differences in the sample inlet, ion source, and mass analyzer generally define the type of instrument and its capabilities. For example, the inlet can be a capillary column liquid chromatography source or a direct probe or stage used in matrix-assisted laser desorption, etc. Common ion sources are, for example, electrospray (including nanospray and microspray) or matrix-assisted laser desorption. Common mass analyzers include quadrupole mass filters, ion trap mass analyzers, and time-of-flight mass analyzers. Additional mass spectrometry methods are well known in the art (see Burlingame et al. Anal. Chem. 70:647 R-716R (1998); Kinter and Sherman, New York (2000)).

タンパク質バイオマーカー及びバイオマーカー値は、以下の：エレクトロスプレーイオン化質量分析（ＥＳＩ－ＭＳ）、ＥＳＩ－ＭＳ／ＭＳ、ＥＳＩ－ＭＳ／（ＭＳ）ｎ、マトリックス支援レーザー脱離イオン化飛行時間型質量分析（ＭＡＬＤＩ－ＴＯＦ－ＭＳ）、表面増強レーザー脱離／イオン化飛行時間型質量分析（ＳＥＬＤＩ－ＴＯＦ－ＭＳ）、シリコン上脱離／イオン化（ＤＩＯＳ）、二次イオン質量分析（ＳＩＭＳ）、四極子－飛行時間（Ｑ－ＴＯＦ）、ｕｌｔｒａｆｌｅｘＩＩＩＴＯＦ／ＴＯＦと呼ばれるタンデム飛行時間型（ＴＯＦ／ＴＯＦ）技術、大気圧化学イオン化質量分析（ＡＰＣＩ－ＭＳ）、ＡＰＣＩ－ＭＳ／ＭＳ、ＡＰＣＩ－（ＭＳ）Ｎ、大気圧光イオン化質量分析（ＡＰＰＩ－ＭＳ）、ＡＰＰＩ－ＭＳ／ＭＳ、及びＡＰＰＩ－（ＭＳ）Ｎ、四極子質量分析、フーリエ変換質量分析（ＦＴＭＳ）、定量的質量分析、ならびにイオントラップ質量分析のうちの任意のものによって検出及び測定され得る。Protein biomarkers and biomarker values were analyzed using the following methods: electrospray ionization mass spectrometry (ESI-MS), ESI-MS/MS, ESI-MS/(MS)n, matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS), surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF-MS), silicon-on-silicon desorption/ionization (DIOS), secondary ion mass spectrometry (SIMS), quadrupole-time-of-flight (Q-TOF), and ultraflex III. They can be detected and measured by any of the following tandem time-of-flight (TOF/TOF) techniques: TOF/TOF, atmospheric pressure chemical ionization mass spectrometry (APCI-MS), APCI-MS/MS, APCI-(MS)N, atmospheric pressure photoionization mass spectrometry (APPI-MS), APPI-MS/MS, and APPI-(MS)N, quadrupole mass spectrometry, Fourier transform mass spectrometry (FTMS), quantitative mass spectrometry, and ion trap mass spectrometry.

サンプル調製戦略は、タンパク質バイオマーカーの質量分析による特徴づけ及びバイオマーカー値の決定の前に、サンプルを標識及びエンリッチするために使用される。標識方法としては、相対量及び絶対量測定のためのアイソバリックタグ（ｉＴＲＡＱ）及び細胞培養におけるアミノ酸による安定同位体標識（ＳＩＬＡＣ）が挙げられるがこれらに限定されない。質量分析による分析の前の候補バイオマーカータンパク質のためのサンプルを選択的にエンリッチするために使用される捕捉試薬としては、アプタマー、抗体、核酸プローブ、キメラ、小分子、Ｆ（ａｂ’）_２断片、一本鎖抗体断片、Ｆｖ断片、一本鎖Ｆｖ断片、核酸、レクチン、リガンド結合受容体、アフィボディ、ナノボディ、アンキリン、ドメイン抗体、代替抗体スキャフォールド（例えばダイアボディなど）インプリントポリマー、アビマー、ペプチド模倣体、ペプトイド、ペプチド核酸、トレオース核酸、ホルモン受容体、サイトカイン受容体、及び合成の受容体、ならびにこれらの修飾物及び断片が挙げられるがこれらに限定されない。 Sample preparation strategies are used to label and enrich samples prior to mass spectrometric characterization of protein biomarkers and determination of biomarker values. Labeling methods include, but are not limited to, isobaric tagging for relative and absolute quantification (iTRAQ) and stable isotope labeling with amino acids in cell culture (SILAC). Capture reagents used to selectively enrich samples for candidate biomarker proteins prior to mass spectrometric analysis include, but are not limited to, aptamers, antibodies, nucleic acid probes, chimeras, small molecules, F(ab')₂ fragments, single-chain antibody fragments, Fv fragments, single-chain Fv fragments, nucleic acids, lectins, ligand-binding receptors, affibodies, nanobodies, ankyrins, domain antibodies, surrogate antibody scaffolds (e.g., diabodies), imprinted polymers, avimers, peptidomimetics, peptoids, peptide nucleic acids, threose nucleic acids, hormone receptors, cytokine receptors, and synthetic receptors, as well as modifications and fragments thereof.

近接ライゲーションアッセイを使用するバイオマーカー値の決定
近接ライゲーションアッセイは、バイオマーカー値を決定するために使用され得る。簡潔には、検査サンプルは、ペアの抗体またはペアのアプタマーであり得るペアの親和性プローブと接触させられ、当該ペアの各々のメンバーは、オリゴヌクレオチドにより延長されている。ペアの親和性プローブについての標的は、１つのタンパク質上の２つの別個の決定基、またはホモ多量体もしくはヘテロ多量体の複合体として存在して得る２つの異なるタンパク質の各々上の１つの決定基であり得る。プローブが標的決定基へ結合する場合に、オリゴヌクレオチド延長部の自由端部は、一緒にハイブリダイズするのに十分に近接させられる。オリゴヌクレオチド延長部のハイブリダイゼーションは、オリゴヌクレオチド延長部が十分に近接して位置する場合に、それらを一緒に架橋する役目を果たす共通のコネクターオリゴヌクレオチドによって助長される。一旦プローブのオリゴヌクレオチド延長部がハイブリダイズされたならば、延長部の端部は、酵素によるＤＮＡライゲーションによって一緒に繋がれる。 Determining Biomarker Values Using Proximity Ligation Assays Proximity ligation assays can be used to determine biomarker values. Briefly, a test sample is contacted with a pair of affinity probes, which can be a pair of antibodies or a pair of aptamers, with each member of the pair extended with an oligonucleotide. The targets for the pair of affinity probes can be two distinct determinants on a single protein, or one determinant on each of two different proteins, which can exist as a homo- or hetero-multimeric complex. When the probes bind to the target determinants, the free ends of the oligonucleotide extensions are brought into sufficient proximity to hybridize together. Hybridization of the oligonucleotide extensions is facilitated by a common connector oligonucleotide, which serves to bridge the oligonucleotide extensions together when they are positioned sufficiently close together. Once the oligonucleotide extensions of the probes are hybridized, the ends of the extensions are joined together by enzymatic DNA ligation.

各々のオリゴヌクレオチド延長部は、ＰＣＲ増幅のためのプライマー部位を含む。一旦オリゴヌクレオチド延長部が一緒にライゲーションされたならば、オリゴヌクレオチドは連続的なＤＮＡ配列を形成し、当該ＤＮＡ配列は、ＰＣＲ増幅を介して、標的タンパク質の同一性及び量に関する情報、そして標的決定基が２つの異なるタンパク質上にある場合のタンパク質－タンパク質相互作用に関する情報を明らかにする。近接ライゲーションは、リアルタイムＰＣＲの使用を介して、リアルタイムのタンパク質濃度及び相互作用の情報について高感度且つ特異的なアッセイを提供し得る。関心のある決定基を結合しないプローブは、対応するオリゴヌクレオチド延長部を近接させず、ライゲーションまたはＰＣＲ増幅は進行できず、産生されているシグナルをもたらさない。Each oligonucleotide extension contains a primer site for PCR amplification. Once the oligonucleotide extensions are ligated together, the oligonucleotides form a continuous DNA sequence that, through PCR amplification, reveals information about the identity and quantity of the target protein, and, if the target determinants are on two different proteins, information about protein-protein interactions. Proximity ligation can provide a highly sensitive and specific assay for real-time protein concentration and interaction information through the use of real-time PCR. Probes that do not bind the determinants of interest will not bring the corresponding oligonucleotide extension into proximity, and ligation or PCR amplification cannot proceed, resulting in no signal being produced.

前述のアッセイは、これまでにタバコ使用者であった確率を予測する方法において有用なバイオマーカーレベルの検出を可能にし、当該方法は、個体からの生物学的サンプルにおける、各々が表４中で提供されるバイオマーカーからなる群から選択されるバイオマーカーに対応するバイオマーカーレベルを検出することを含み、以下で詳細に記載されるように、バイオマーカーレベルを使用する分類は、個体がこれまでにタバコ使用者であった確率を有するかどうかを示す。記載されるバイオマーカーのうち特定のものは単独で、個体がこれまでにタバコ使用者であった確率を決定するのに有用であるが、各々が２つ以上のバイオマーカーのパネルとして有用なバイオマーカーの複数のサブセットのグループ化のための方法も本明細書において記載される。本明細書において記載される方法のうちの任意のものに従って、バイオマーカーレベルは、個別に検出及び分類され得るかまたは例えばマルチプレックスアッセイフォーマットにおいて集合的に検出及び分類され得る。The foregoing assays allow for the detection of biomarker levels useful in methods for predicting the probability of being a former tobacco user, the methods comprising detecting, in a biological sample from an individual, biomarker levels each corresponding to a biomarker selected from the group consisting of the biomarkers provided in Table 4, and classification using the biomarker levels, as described in detail below, indicates whether the individual has a probability of being a former tobacco user. While certain of the described biomarkers are useful alone in determining the probability that an individual is a former tobacco user, methods are also described herein for grouping multiple subsets of biomarkers, each useful as a panel of two or more biomarkers. According to any of the methods described herein, biomarker levels can be detected and classified individually or collectively, for example, in a multiplex assay format.

バイオマーカーの分類及び疾患スコアの計算
いくつかの実施形態において、バイオマーカー「シグネチャー」は、所与の診断検査または予測検査についてのマーカーのセットを含有し、各々のマーカーは、関心のある集団において異なるレベルを有する。異なるレベルは、この文脈において、２つ以上の群における個体についてのマーカーレベルの異なる平均、２つ以上の群における異なる分散、または両方の組み合わせを指し得る。診断検査の最も単純な形態については、これらのマーカーは、個体からの未知のサンプルを、２つの群（タバコ使用の既往歴を有するかまたは有しない等）のうちの１つの中へ割り当てるために使用され得る。サンプルを２つ以上の群の１つの中への割り当てることは、分類として公知であり、この割り当てを達成するために使用される手順は、分類子または分類方法として公知である。分類方法は、スコアリング方法とも称され得る。バイオマーカー値のセットから診断用の分類子を構築するために使用され得る多くの分類方法がある。概して、分類方法は、区別したい２つの（または多重分類状態についてはそれを超える）別個の群内の個体から得られたサンプルを使用して、データセットが収集される場合、教師あり学習技法を使用して最も容易に遂行される。各々のサンプルが属するクラス（群または集団）が各々のサンプルについて先立って知られているので、分類方法は、所望される分類応答を与えるように訓練され得る。診断用の分類子を生ずるために教師なし学習技法を使用することも可能である。 Biomarker Classification and Disease Score Calculation In some embodiments, a biomarker "signature" contains a set of markers for a given diagnostic or predictive test, each marker having different levels in a population of interest. Different levels, in this context, can refer to different means of marker levels for individuals in two or more groups, different variances in two or more groups, or a combination of both. For the simplest form of a diagnostic test, these markers can be used to assign an unknown sample from an individual into one of two groups (such as with or without a history of tobacco use). Assigning a sample into one of two or more groups is known as classification, and the procedure used to achieve this assignment is known as a classifier or classification method. Classification methods can also be referred to as scoring methods. There are many classification methods that can be used to construct a diagnostic classifier from a set of biomarker values. Generally, classification methods are most easily accomplished using supervised learning techniques when a dataset is collected using samples obtained from individuals in two (or more for multiple classification situations) distinct groups one wishes to distinguish. Since the class (group or population) to which each sample belongs is known in advance for each sample, classification methods can be trained to give the desired classification response. It is also possible to use unsupervised learning techniques to generate classifiers for diagnostic purposes.

診断用の分類子を開発するための一般的なアプローチとしては、ディシジョントリー；バギング、ブースティング、フォレスト、及びランダムフォレスト；ルール推論ベースの学習；パルツェン窓；線形モデル；ロジスティック；ニューラルネットワーク方法；教師なしクラスタリング；ｋ平均法；階層昇順／降順；半教師あり学習；プロトタイプ方法；最近傍；カーネル密度推定；サポートベクターマシン；隠れマルコフモデル；ボルツマン学習が挙げられ、分類子は、単純に組み合わせられ得るか、または特定の目的関数を最小限にする手法で組み合わせられ得る。総説については、例えばＰａｔｔｅｒｎＣｌａｓｓｉｆｉｃａｔｉｏｎ，Ｒ．Ｏ．Ｄｕｄａ，ｅｔａｌ．，ｅｄｉｔｏｒｓ，ＪｏｈｎＷｉｌｅｙ＆Ｓｏｎｓ，２ｎｄｅｄｉｔｉｏｎ，２００１を参照されたい。ＴｈｅＥｌｅｍｅｎｔｓｏｆＳｔａｔｉｓｔｉｃａｌＬｅａｒｎｉｎｇ－ＤａｔａＭｉｎｉｎｇ，Ｉｎｆｅｒｅｎｃｅ，ａｎｄＰｒｅｄｉｃｔｉｏｎ，Ｔ．Ｈａｓｔｉｅ，ｅｔａｌ．，ｅｄｉｔｏｒｓ，ＳｐｒｉｎｇｅｒＳｃｉｅｎｃｅ＋ＢｕｓｉｎｅｓｓＭｅｄｉａ，ＬＬＣ，２ｎｄｅｄｉｔｉｏｎ，２００９も参照されたい。同文献の各々は、参照することによってそれらの全体が援用される。Common approaches for developing diagnostic classifiers include decision trees; bagging, boosting, forests, and random forests; rule-based learning; Parzen windows; linear models; logistic regression; neural network methods; unsupervised clustering; k-means; hierarchical ascending/descending; semi-supervised learning; prototype methods; nearest neighbors; kernel density estimation; support vector machines; hidden Markov models; and Boltzmann learning. Classifiers can be combined simply or in a manner that minimizes a specific objective function. For a review, see, e.g., "Pattern Classification," R. O. Duda, et al., editors, John Wiley & Sons, 2nd edition, 2001. See also *The Elements of Statistical Learning - Data Mining, Inference, and Prediction*, T. Hastie, et al., editors, Springer Science+Business Media, LLC, 2nd edition, 2009, each of which is incorporated by reference in its entirety.

教師あり学習技法を使用して分類子を生ずるために、訓練データと呼ばれるサンプルのセットが得られる。診断検査の文脈において、訓練データは、未知のサンプルが後に割り当てられる別個の群（クラス）からのサンプルを含む。例えば、対照集団中の個体、及び特定の疾患、病態、または事象の集団（「これまでの」タバコ使用者（以前または現在のタバコ使用者）等）中の個体から収集されたサンプルは、未知のサンプル（またはより具体的にはサンプルが得られた個体）を、「これまでの」タバコ使用者または「１度もタバコを使用したことがない」者のいずれかであるとして分類することができる分類子を開発するために、訓練データを構成し得る。訓練データからの分類子の開発は、分類子の訓練として公知である。分類子訓練についての具体的な詳細は、教師あり学習技法の性質に依存する（例えばＰａｔｔｅｒｎＣｌａｓｓｉｆｉｃａｔｉｏｎ，Ｒ．Ｏ．Ｄｕｄａ，ｅｔａｌ．，ｅｄｉｔｏｒｓ，ＪｏｈｎＷｉｌｅｙ＆Ｓｏｎｓ，２ｎｄｅｄｉｔｉｏｎ，２００１を参照；ＴｈｅＥｌｅｍｅｎｔｓｏｆＳｔａｔｉｓｔｉｃａｌＬｅａｒｎｉｎｇ－ＤａｔａＭｉｎｉｎｇ，Ｉｎｆｅｒｅｎｃｅ，ａｎｄＰｒｅｄｉｃｔｉｏｎ，Ｔ．Ｈａｓｔｉｅ，ｅｔａｌ．，ｅｄｉｔｏｒｓ，ＳｐｒｉｎｇｅｒＳｃｉｅｎｃｅ＋ＢｕｓｉｎｅｓｓＭｅｄｉａ，ＬＬＣ，２ｎｄｅｄｉｔｉｏｎ，２００９も参照）。To generate a classifier using supervised learning techniques, a set of samples, called training data, is obtained. In the context of diagnostic testing, training data includes samples from distinct groups (classes) to which unknown samples will subsequently be assigned. For example, samples collected from individuals in a control population and individuals in a particular disease, condition, or event population (such as "ever" tobacco users (former or current tobacco users)) may constitute training data for developing a classifier that can classify unknown samples (or more specifically, the individuals from whom the samples were obtained) as either "ever" tobacco users or "never tobacco users." Developing a classifier from training data is known as training the classifier. Specific details about classifier training depend on the nature of the supervised learning technique (see, e.g., Pattern Classification, R.O. Duda, et al., editors, John Wiley & Sons, 2nd edition, 2001; see also The Elements of Statistical Learning - Data Mining, Inference, and Prediction, T. Hastie, et al., editors, Springer Science + Business Media, LLC, 2nd edition, 2009).

典型的には、訓練セット中のサンプルよりもより可能性のあるバイオマーカー値が存在するので、過剰適合を回避するように配慮しなければならない。統計モデルが、背景の関連の代わりに、ランダムエラーまたはノイズを表す場合に、過剰適合が起こる。過剰適合は、様々な手法で（例えば分類子の開発において使用されるマーカーの数を限定することによって、マーカー応答が互いに独立していると仮定することによって、用いられる背景の統計モデルの複雑度を限定することによって、及び背景の統計モデルがデータに一致することを確実にすることによって、が挙げられる）、回避され得る。Because there are typically more possible biomarker values than samples in the training set, care must be taken to avoid overfitting. Overfitting occurs when the statistical model represents random error or noise instead of background associations. Overfitting can be avoided in various ways, including, for example, by limiting the number of markers used in developing the classifier, by assuming that marker responses are independent of each other, by limiting the complexity of the background statistical model used, and by ensuring that the background statistical model fits the data.

事象の出現と関連するバイオマーカーのセットを同定するために、対照サンプル及び初期事象サンプルの組み合わせたセットは、主成分分析（ＰＣＡ）を使用して分析された。ＰＣＡは、事例または対照の転帰を考慮せずに、すべてのサンプル間の最も大きな変動によって定義される軸に関してサンプルを提示し、したがって、事例と対照との間の区別を過剰適合するリスクを軽減する。重篤な血栓事象の出現は、強力な偶然の構成要素が関与しており、生命にかかわる血管中の破裂しそうな不安定なプラークの報告が要求されるので、対照サンプルセットと事象サンプルセットとの間で明らかな分離を観察は期待されなかった。事例と対照との間で観察される分離が大きくないが、第２の主成分で、このサンプルのセットにおける全変動のおよそ１０％に対応する当該分離が起こり、このことから、背景の生物学的変動を定量することが比較的単純であることが指摘される。To identify a set of biomarkers associated with event occurrence, the combined set of control and early event samples was analyzed using principal component analysis (PCA). PCA presents samples relative to an axis defined by the greatest variation among all samples, without considering case or control outcomes, thus reducing the risk of overfitting the distinction between cases and controls. Because the occurrence of a serious thrombotic event involves a strong chance component and requires reporting of vulnerable plaque in a life-threatening blood vessel that is prone to rupture, a clear separation between the control and event sample sets was not expected. While the observed separation between cases and controls was not large, it occurred in the second principal component, accounting for approximately 10% of the total variation in this set of samples, indicating a relatively straightforward approach to quantifying background biological variation.

分析の次のセットにおいて、バイオマーカーは、対照サンプルと初期事象サンプルとの間の分離に特異的であったサンプル間の差の成分について分析され得る。用いられ得る１つの方法は、対照セットにおけるサンプル間の変動の上位３つの主成分方向を除去（デフレート）するための、ＤＳＧＡ（Ｂａｉｒ，Ｅ．ａｎｄＴｉｂｓｈｉｒａｎｉ，Ｒ．（２００４）Ｓｅｍｉ－ｓｕｐｅｒｖｉｓｅｄｍｅｔｈｏｄｓｔｏｐｒｅｄｉｃｔｐａｔｉｅｎｔｓｕｒｖｉｖａｌｆｒｏｍｇｅｎｅｅｘｐｒｅｓｓｉｏｎｄａｔａ．ＰＬＯＳＢｉｏｌ．，２，５１１－５２２）の使用である。次元削減は探索するべき対照セットで遂行されるが、対照におけるサンプル及び初期事象からのサンプルの両方は、ＰＣＡを介して実行される。初期事象からの事例の分離は、横軸に沿って観察され得る。In the next set of analyses, biomarkers can be analyzed for components of sample-to-sample difference that were specific to the separation between control and early-event samples. One method that can be used is the use of DSGA (Bair, E. and Tibshirani, R. (2004) Semi-supervised methods to predict patient survival from gene expression data. PLOS Biol., 2, 511-522) to remove (deflate) the top three principal component directions of variation between samples in the control set. While dimensionality reduction is performed on the control set to be explored, both samples in the control and samples from the early event are run through PCA. Separation of cases from the early event can be observed along the horizontal axis.

タバコ使用状態に関連するタンパク質の交差検証された選択
タンパク質予測力を、特定のサンプルの選択の特有のフィーチャへ過剰適合させることを回避するために、交差検証及び次元削除アプローチが、採用され得る。交差検証は、方法がリスクモデルを生ずるのに使用されなかったサンプルへ適用される能力をモニタリングするために、選択されないサンプルの使用と組み合わせて、タンパク質によるリスクの関連を決定するためのサンプルのセットの複数の選択を包含する（ＴｈｅＥｌｅｍｅｎｔｓｏｆＳｔａｔｉｓｔｉｃａｌＬｅａｒｎｉｎｇ－ＤａｔａＭｉｎｉｎｇ，Ｉｎｆｅｒｅｎｃｅ，ａｎｄＰｒｅｄｉｃｔｉｏｎ，Ｔ．Ｈａｓｔｉｅ，ｅｔａｌ．，ｅｄｉｔｏｒｓ，ＳｐｒｉｎｇｅｒＳｃｉｅｎｃｅ＋ＢｕｓｉｎｅｓｓＭｅｄｉａ，ＬＬＣ，２ｎｄｅｄｉｔｉｏｎ，２００９）。本発明者らは、タバコ使用状態のモデル化における高次元データセットへ適用可能なＴｉｂｓｈｉｒａｎｉｅｔａｌの教師ありＰＣＡ方法（Ｂａｉｒ，Ｅ．ａｎｄＴｉｂｓｈｉｒａｎｉ，Ｒ．（２００４）Ｓｅｍｉ－ｓｕｐｅｒｖｉｓｅｄｍｅｔｈｏｄｓｔｏｐｒｅｄｉｃｔｐａｔｉｅｎｔｓｕｒｖｉｖａｌｆｒｏｍｇｅｎｅｅｘｐｒｅｓｓｉｏｎｄａｔａ．ＰＬＯＳＢｉｏｌ．，２，５１１－５２２）を適用した。教師ありＰＣＡ（ＳＰＣＡ）方法は、データにおいて観察された事象危険度と統計的に関連するタンパク質のセットの単変量選択、及びこれらのタンパク質のすべてからの情報を組み合わせる相関する成分の決定を包含する。相関する成分のこの決定は、タンパク質にわたる情報を組み合わせるだけでなく、１０００を超えるタンパク質の完全なタンパク質メニューから少数の主成分まで独立変数の数を低減することによって、過剰適合する可能性も軽減する次元削減ステップである（この作業において、本発明者らは第１の主成分のみを検討した）。 Cross-validated Selection of Proteins Associated with Tobacco Use Status To avoid overfitting protein predictive power to the unique features of a particular sample selection, a cross-validation and dimensionality reduction approach can be employed. Cross-validation involves multiple selection of a set of samples to determine the association of risk with proteins, combined with the use of unselected samples to monitor the ability of the method to apply to samples not used to generate the risk model (The Elements of Statistical Learning - Data Mining, Inference, and Prediction, T. Hastie, et al., editors, Springer Science+Business Media, LLC, 2nd edition, 2009). We applied the supervised PCA method of Tibshirani et al. (Bair, E. and Tibshirani, R. (2004) Semi-supervised methods to predict patient survival from gene expression data. PLOS Biol., 2, 511-522), which is applicable to high-dimensional datasets in modeling tobacco use status. The supervised PCA (SPCA) method involves the univariate selection of a set of proteins statistically associated with the event risk observed in the data, and the determination of correlated components that combine information from all of these proteins. This determination of correlated components is a dimensionality reduction step that not only combines information across proteins, but also reduces the chance of overfitting by reducing the number of independent variables from the full protein menu of over 1000 proteins to a small number of principal components (in this work, we considered only the first principal component).

個別のタンパク質対事象までの時間の関係性の単変量解析及び多変量解析
Ｃｏｘ比例ハザードモデル（Ｃｏｘ，ＤａｖｉｄＲ（１９７２）．“ＲｅｇｒｅｓｓｉｏｎＭｏｄｅｌｓａｎｄＬｉｆｅ－Ｔａｂｌｅｓ”．ＪｏｕｒｎａｌｏｆｔｈｅＲｏｙａｌＳｔａｔｉｓｔｉｃａｌＳｏｃｉｅｔｙ．ＳｅｒｉｅｓＢ（Ｍｅｔｈｏｄｏｌｏｇｉｃａｌ）３４（２）：１８７－２２０））は、医療統計において広く使用される。Ｃｏｘ回帰は、特定の時間関数を累積生存率へ適合させることを回避し、その代りに、ベースラインハザード関数と称される相対リスクのモデルを用いる（これは経時的に変動し得る）。ベースラインハザード関数は、すべての個体についての生存時間分布の一般的な形状を記載し、一方で相対リスクは、１セットの共変量値（単一の個体または群等）についての危険度のレベルを、ベースライン危険度の倍数として与える。相対リスクは、Ｃｏｘモデルにおいて経時的に一定である。 Univariate and Multivariate Analyses of the Relationship Between Individual Proteins and Time to Event The Cox proportional hazards model (Cox, David R (1972). "Regression Models and Life-Tables". Journal of the Royal Statistical Society. Series B (Methodological) 34(2):187-220) is widely used in medical statistics. Cox regression avoids fitting a specific time function to cumulative survival and instead uses a model of relative risk, called the baseline hazard function, which may vary over time. The baseline hazard function describes the general shape of the survival time distribution for all individuals, while the relative risk gives the level of risk for a set of covariate values (such as a single individual or a group) as a multiple of the baseline risk. Relative risk is constant over time in the Cox model.

加速死亡時間（ＡＦＴ）モデルは、生存率モデルのサブクラスである。生存率モデルは、部分的な情報下で事象までの時間のデータを予測する。生存率モデルは打ち切りを考慮するので、これらの打ち切られた対象からのデータを依然として使用し得るが、事象がいつ起こるか予測しようとする他の長期的なモデルは、決定される転帰の対象からの情報のみを使用し得る。そして、生存率モデルが事象までの時間を考慮に入れるので、任意の時間枠内に事象の起こる予測確率を生ずることができ、これは大部分の分類モデル（ロジスティック回帰、ランダムフォレスト）とは異なる。Accelerated mortality time (AFT) models are a subclass of survival models. Survival models predict time-to-event data with partial information. Because survival models account for censoring, they can still use data from these censored subjects, whereas other longitudinal models that attempt to predict when an event will occur may only use information from subjects with a determined outcome. And because survival models account for time-to-event, they can produce predicted probabilities of an event occurring within any time frame, which differs from most classification models (logistic regression, random forests).

ＡＦＴ生存率モデルは、特に、モデルの共変量とｌｏｇ（事象までの時間）との間の線形関係を規定／仮定する回帰モデルである。したがって、ベースラインよりも２倍高い共変量（タンパク質ＲＦＵカウント）の対象は、診断からベースラインよりも２倍長く「生存する」と予測され得る。The AFT survival model is a regression model that, among other things, defines/assumes a linear relationship between the model's covariates and log(time to event). Thus, a subject with a covariate (protein RFU count) twice as high as baseline may be predicted to "survive" twice as long from diagnosis as baseline.

２つの最も一般的な生存率モデルはＡＦＴモデル及び比例ハザードモデルであり、ＡＦＴＷｅｉｂｕｌｌモデルは両方である。比例ハザードモデルの定義はＡＦＴモデルよりも少し複雑性であり、比例ハザードモデルにおいて、ベースラインよりも２倍高い共変量の対象は、任意の時点で２倍高い危険度を有し得、危険度は経時的な生存曲線の負の導関数である。The two most common survival models are the AFT model and the proportional hazards model, with the AFT Weibull model being both. The proportional hazards model is slightly more complex to define than the AFT model; in a proportional hazards model, a subject with a covariate that is twice as high as baseline will have a twice as high risk at any time point, with risk being the negative derivative of the survival curve over time.

他の一般的な比例ハザードモデルは、指数関数及びＣｏｘモデルである。指数関数モデルは、Ｗｅｉｂｕｌｌモデルのサブタイプである。Ｃｏｘモデルは、使用がより限定されており、事象までの時間の予測確率は、Ｃｏｘモデルから利用可能ではなく、相対リスクのみが利用可能である。ＡＦＴモデルは、絶対リスク及び相対リスクの両方を与え得る。Other common proportional hazards models are the exponential and Cox models. The exponential model is a subtype of the Weibull model. The Cox model is more limited in use; predicted probabilities of time to event are not available from the Cox model, only relative risks. The AFT model can provide both absolute and relative risks.

キット
表４のバイオマーカーのうちの任意のものの組み合わせは、本明細書において開示される方法の遂行における使用等に好適なキットを使用して検出され得る。さらに、任意のキットは、本明細書において記載される１つまたは複数の検出可能な標識（蛍光部分など）を含有し得る。 Kits Any combination of the biomarkers in Table 4 can be detected using kits suitable for use in performing the methods disclosed herein. Additionally, any kit can contain one or more detectable labels (such as fluorescent moieties) described herein.

一実施形態において、キットは、（ａ）生物学的サンプル中の１つまたは複数のバイオマーカーの検出のための１つまたは複数の捕捉試薬（例えば少なくとも１つのアプタマーまたは抗体等）であって、バイオマーカーが、表４中で示されるバイオマーカーのうちの任意のものを含む、前記捕捉試薬、及び任意選択で、（ｂ）生物学的サンプルが得られた個体を、本明細書においてさらに記載されるように、「これまでの」タバコ使用者である確率を有するかもしくは有さないかのいずれかとして分類するための１つまたは複数のソフトウェアまたはコンピュータープログラム製品を含む。代替的に、１つまたは複数のコンピュータープログラム製品ではなく、人間によって上記のステップを手動で遂行するための１つまたは複数の指示書が提供され得る。In one embodiment, the kit includes (a) one or more capture reagents (e.g., at least one aptamer or antibody) for the detection of one or more biomarkers in a biological sample, where the biomarkers include any of the biomarkers set forth in Table 4, and optionally (b) one or more software or computer program products for classifying an individual from whom the biological sample was obtained as either having or not having a probability of being a "ever" tobacco user, as further described herein. Alternatively, rather than one or more computer program products, one or more instructions for manually performing the above steps by a human may be provided.

固体支持体とシグナル生成物質を有する対応する捕捉試薬との組み合わせは、本明細書において「検出デバイス」または「キット」と称される。キットは、デバイス及び試薬の使用、サンプルの取り扱い、ならびにデータの分析のための指示書も含み得る。さらに、キットは、生物学的サンプルを分析し、その分析結果を報告するために、コンピューターシステムまたはソフトウェアと共に使用され得る。The combination of a solid support and a corresponding capture reagent with a signal-generating substance is referred to herein as a "detection device" or "kit." The kit may also include instructions for using the device and reagents, handling samples, and analyzing data. Additionally, the kit may be used with a computer system or software to analyze biological samples and report the results of the analysis.

キットは、生物学的サンプルのプロセシングのための１つまたは複数の試薬（例えば可溶化バッファー、デタージェント、洗浄剤、またはバッファー）も含有し得る。本明細書において記載されるキットのうちの任意のものは、例えばバッファー、ブロッキング剤、質量分析マトリックス材、抗体捕捉剤、陽性対照サンプル、陰性対照サンプル、ソフトウェア及び情報（プロトコル、ガイダンス、及び参照データ等）も含み得る。The kits may also contain one or more reagents for processing the biological sample (e.g., solubilization buffer, detergent, cleaning agent, or buffer). Any of the kits described herein may also include, for example, buffers, blocking agents, mass spectrometry matrix materials, antibody capture agents, positive control samples, negative control samples, software, and information (e.g., protocols, guidance, and reference data).

一態様において、本発明は、タバコ使用状態の査定のためのキットを提供する。キットは、表４から選択されるバイオマーカーへ特異的な１つまたは複数のアプタマーのためのＰＣＲプライマーを含む。キットは、使用のための指示書、及びバイオマーカーと「これまでの」タバコ使用者である確率の予測との相関についての指示書をさらに含み得る。キットは、表４から選択されるバイオマーカーに特異的なアプタマーのうちの１つまたは複数の相補物を含有するＤＮＡアレイ、試薬、及び／またはサンプルＤＮＡを増幅もしくは単離するための酵素も含み得る。キットは、リアルタイムＰＣＲのための試薬（例えばＴａｑＭａｎプローブ及び／またはプライマー、ならびに酵素）を含み得る。In one aspect, the present invention provides a kit for assessing tobacco use status. The kit includes PCR primers for one or more aptamers specific to a biomarker selected from Table 4. The kit may further include instructions for use and instructions for correlating the biomarker with a prediction of the probability of being a "ever" tobacco user. The kit may also include a DNA array containing complements of one or more aptamers specific to a biomarker selected from Table 4, reagents, and/or enzymes for amplifying or isolating sample DNA. The kit may include reagents for real-time PCR (e.g., TaqMan probes and/or primers, and enzymes).

例えば、キットは、（ａ）検査サンプル中の１つまたは複数のバイオマーカーを定量するための少なくとも捕捉試薬を含む試薬であって、前記バイオマーカーが、表４中で示されるバイオマーカー、または本明細書において記載される他のバイオマーカーもしくはバイオマーカーパネルを含む、前記試薬、ならびに任意選択で、（ｂ）検査サンプル中の定量された各々のバイオマーカーの量を１つまたは複数の既定のカットオフに比較し、前記比較に基づいて定量された各々のバイオマーカーについてのスコアを割り当てるステップ、定量された各々のバイオマーカーについての割り当てられたスコアを合わせて合計のスコアを得るステップ、合計のスコアを既定のスコアと比較するステップ、及び前記比較を使用して、個体が「これまでに」タバコ使用者であった確率を決定するステップを遂行するための１つまたは複数のアルゴリズムまたはコンピュータープログラムを含み得る。代替的に、１つまたは複数のアルゴリズムまたはコンピュータープログラムではなく、人間によって上記のステップを手動で遂行するための１つまたは複数の指示書が提供され得る。For example, the kit may include: (a) a reagent including at least a capture reagent for quantifying one or more biomarkers in a test sample, wherein the biomarkers include the biomarkers shown in Table 4 or other biomarkers or biomarker panels described herein; and, optionally, (b) one or more algorithms or computer programs for performing the steps of: comparing the amount of each quantified biomarker in the test sample to one or more predetermined cutoffs and assigning a score for each quantified biomarker based on the comparison; combining the assigned scores for each quantified biomarker to obtain a total score; comparing the total score to a predetermined score; and using the comparison to determine the probability that the individual was an "ever" tobacco user. Alternatively, one or more instructions for manually performing the above steps by a human may be provided rather than one or more algorithms or computer programs.

コンピューターの方法及びソフトウェア
一旦バイオマーカーまたはバイオマーカーパネルが選択されたならば、個体を診断する方法は、以下の：１）生物学的サンプルを収集するかまたは他の方法で得ること；２）分析方法を遂行して、生物学的サンプル中のバイオマーカーまたはパネルのバイオマーカーを検出及び測定すること；３）バイオマーカーレベルを収集するために使用される方法に要求される任意のデータの正規化または標準化を遂行すること；４）マーカースコアを計算すること；５）マーカースコアを合わせて合計の診断スコアまたは予測スコアを得ること；ならびに６）個体の診断スコアまたは予測スコアを報告することを含み得る。このアプローチにおいて、診断スコアまたは予測スコアは、すべてのマーカー計算の合計から決定される単一の数値であり得、これは、疾患の存在もしくは非存在の指標、または「これまでの」タバコ使用者もしくは「１度もタバコを使用したことがない」者の指標である、前設定された閾値に比較される。または、診断スコアもしくは予測スコアは、各々がバイオマーカーレベルを表わすバーのシリーズであり得、応答のパターンは、疾患、病態、もしくは事象の増加した（または増加していない）リスクの存在もしくは非存在の決定のために前設定されたパターンに比較され得る。 Computer Methods and Software Once a biomarker or panel of biomarkers has been selected, a method for diagnosing an individual may include: 1) collecting or otherwise obtaining a biological sample; 2) performing an analytical method to detect and measure the biomarker or panel of biomarkers in the biological sample; 3) performing any data normalization or standardization required by the method used to collect biomarker levels; 4) calculating marker scores; 5) combining the marker scores to obtain a total diagnostic or predictive score; and 6) reporting the individual's diagnostic or predictive score. In this approach, the diagnostic or predictive score may be a single number determined from the sum of all marker calculations, which is compared to a preset threshold that is indicative of the presence or absence of disease, or of being an "ever" tobacco user or a "never" tobacco user. Alternatively, the diagnostic or predictive score may be a series of bars, each representing a biomarker level, and the pattern of response may be compared to a preset pattern to determine the presence or absence of increased (or non-increased) risk of a disease, condition, or event.

本明細書において記載される方法の少なくともいくつかの実施形態は、コンピューターの使用により実装され得る。コンピューターシステム１００の例が、図５中で示される。図５を参照して、システム１００は、プロセッサー１０１、入力デバイス１０２、出力デバイス１０３、ストレージデバイス１０４、コンピューター可読ストレージメディアリーダー１０５ａ、通信システム１０６、プロセシング加速（例えばＤＳＰまたは専用プロセッサー）１０７、及びメモリー１０９を含む、バス１０８を介して電気的にカップリングされたハードウェア要素から構成されることが示される。コンピューター可読ストレージメディアリーダー１０５ａは、コンピューター可読ストレージメディア１０５ｂへさらにカップリングされ、この組み合わせは、コンピューター可読情報を一時的に及び／またはより恒久的に含有するための、遠隔の、ローカルな、固定の、及び／またはリムーバブルなストレージデバイスに、ストレージメディア、メモリーなどを加えものを包括的に表わし、これは、ストレージデバイス１０４、メモリー１０９、及び／または任意の他のかかるアクセス可能なシステム１００リソースを含み得る。システム１００は、オペレーティングシステム１９２及び他のコード１９３（プログラム、データ、及び同種のもの等）を含む、ソフトウェア要素（一般に作業メモリー１９１内に位置するとして示される）も含む。At least some embodiments of the methods described herein may be implemented using a computer. An example computer system 100 is shown in FIG. 5. Referring to FIG. 5, system 100 is shown to be comprised of hardware elements electrically coupled via bus 108, including a processor 101, input devices 102, output devices 103, storage devices 104, computer-readable storage media reader 105a, communications system 106, processing acceleration (e.g., a DSP or dedicated processor) 107, and memory 109. Computer-readable storage media reader 105a is further coupled to computer-readable storage media 105b, which combination collectively represents remote, local, fixed, and/or removable storage devices, plus storage media, memory, etc., for temporarily and/or more permanently containing computer-readable information, which may include storage device 104, memory 109, and/or any other such accessible system 100 resource. System 100 also includes software elements (generally shown as residing in working memory 191), including an operating system 192 and other code 193 (such as programs, data, and the like).

図５に関して、システム１００は、幅広い柔軟性及び構成可能性を有する。したがって例えば、単一のアーキテクチャは１つまたは複数のサーバを実装するために利用され得、当該サーバは、一般に所望されるプロトコル、プロトコルのバリエーション、拡張などに従ってさらに構成され得る。しかしながら、実施形態がより具体的なアプリケーション要求に従って良好に利用され得ることは、当業者に明らかであろう。例えば、１つまたは複数のシステム要素は、システム１００構成要素内の（例えば通信システム１０６内の）下位要素として実装され得る。カスタマイズされたハードウェアも利用され得、及び／または特定の要素は、ハードウェア、ソフトウェア、もしくは両方において実装され得る。さらに、ネットワーク入力／出力デバイス（図示せず）等の他のコンピューティングデバイスへの接続が用いられ得るが、他のコンピューティングデバイスへの有線、無線、モデム、及び／または、他の接続（複数可）も利用され得ることを理解されたい。With reference to FIG. 5, system 100 has a wide range of flexibility and configurability. Thus, for example, a single architecture may be utilized to implement one or more servers, which may be further configured according to commonly desired protocols, protocol variations, extensions, and the like. However, it will be apparent to those skilled in the art that embodiments may be better utilized according to more specific application requirements. For example, one or more system elements may be implemented as sub-elements within a system 100 component (e.g., within communications system 106). Customized hardware may also be utilized, and/or particular elements may be implemented in hardware, software, or both. Additionally, connections to other computing devices, such as network input/output devices (not shown), may be utilized, although it should be understood that wired, wireless, modem, and/or other connection(s) to other computing devices may also be utilized.

一態様において、システムは、個体が「これまでに」タバコ使用者である確率の予測のバイオマーカー特徴のフィーチャを含有するデータベースを含み得る。バイオマーカーデータ（またはバイオマーカー情報）は、コンピューター実装方法の一部としての使用のために、コンピューターへの入力として利用され得る。バイオマーカーデータは、本明細書において記載されるデータを含み得る。In one embodiment, the system may include a database containing features of biomarker characteristics predictive of the probability that an individual is an "ever" tobacco user. The biomarker data (or biomarker information) may be utilized as input to a computer for use as part of a computer-implemented method. The biomarker data may include the data described herein.

一態様において、システムは、１つまたは複数のプロセッサーへの入力データを提供するための１つまたは複数のデバイスをさらに含む。In one aspect, the system further includes one or more devices for providing input data to the one or more processors.

システムは、ランク付けられたデータ要素のデータセットの保存ためのメモリーをさらに含む。The system further includes a memory for storing the dataset of ranked data elements.

別の態様において、入力データを提供するためのデバイスは、データ要素の特徴を検出するための検出器（例えば質量分析計または遺伝子チップリーダー等）を含む。In another embodiment, the device for providing input data includes a detector (e.g., a mass spectrometer or gene chip reader) for detecting characteristics of the data elements.

システムは、追加でデータベース管理システムを含み得る。ユーザーのリクエストまたはクエリは、クエリをプロセシングして訓練セットのデータベースからの関係情報を抽出するデータベース管理システムによって理解される適切な言語でフォーマットされ得る。The system may additionally include a database management system. User requests or queries may be formatted in an appropriate language understood by the database management system, which processes the queries and extracts relevant information from the training set database.

システムは、ネットワークサーバ及び１つまたは複数のクライアントが接続されるネットワークへ接続可能であり得る。ネットワークは、当技術分野において公知のように、ローカルエリアネットワーク（ＬＡＮ）またはワイドエリアネットワーク（ＷＡＮ）であり得る。好ましくは、サーバは、コンピュータープログラム製品（例えばソフトウェア）を実行して、ユーザーリクエストのプロセシングのためにデータベースデータにアクセスするのに必要なハードウェアを含む。The system may be connectable to a network to which a network server and one or more clients are connected. The network may be a local area network (LAN) or a wide area network (WAN), as known in the art. Preferably, the server includes the hardware necessary to execute computer program products (e.g., software) and access database data for processing user requests.

システムは、データベース管理システムからの指示の実行のためのオペレーティングシステム（例えばＵＮＩＸまたはＬｉｎｕｘ）を含み得る。一態様において、オペレーティングシステムはグローバル通信ネットワーク（インターネット等）上で作動し、かかるネットワークへ接続するためにグローバル通信ネットワークサーバを利用し得る。The system may include an operating system (e.g., UNIX or Linux) for executing instructions from the database management system. In one aspect, the operating system may operate over a global communications network (e.g., the Internet) and utilize a global communications network server for connecting to such a network.

システムは、当技術分野において公知のグラフィカルユーザーインターフェースにおいて慣例的に見出されるような、インターフェース要素（ボタン、プルダウンメニュー、スクロールバー、テキスト登録用フィールド、及び同種のもの等）を含むグラフィカルディスプレイインターフェースを含む、１つまたは複数のデバイスを含み得る。ユーザーインタフェース上で登録されたリクエストを、フォーマットのためにシステム中のアプリケーションプログラムへ送信して、システムデータベースのうちの１つまたは複数における関係情報について検索することができる。ユーザーによって登録されたリクエストまたはクエリは、任意の好適なデータベース言語で構築され得る。The system may include one or more devices that include a graphical display interface that includes interface elements (such as buttons, pull-down menus, scroll bars, text entry fields, and the like) conventionally found in graphical user interfaces known in the art. Requests submitted on the user interface may be sent to application programs in the system for formatting and searching for relevant information in one or more of the system databases. Requests or queries submitted by users may be constructed in any suitable database language.

グラフィカルユーザーインターフェースは、オペレーティングシステムの一部としてグラフィカルユーザーインターフェースコードによって生成され得、データを入力するため、及び／または入力されたデータをディスプレイするために使用され得る。プロセシングされたデータの結果は、インターフェースでディスプレイされ得る、システムと通信中のプリンターでプリントされ得る、メモリーデバイス中で保存され得る、及び／もしくはネットワークにわたって送信され得るか、またはコンピューター可読メディアの形態で提供され得る。A graphical user interface may be generated by graphical user interface code as part of the operating system and may be used to input data and/or display input data. The results of processed data may be displayed in the interface, printed on a printer in communication with the system, stored in a memory device, and/or transmitted over a network, or provided in the form of a computer-readable medium.

システムは、データ要素に関するデータ（例えば発現値）をシステムへ提供するために、入力デバイスと通信中であり得る。一態様において、入力デバイスは、例えば質量分析計、遺伝子チップ、またはアレイリーダー、及び同種のものを含む、遺伝子発現プロファイリングシステムを含み得る。The system may be in communication with an input device to provide data (e.g., expression values) regarding the data elements to the system. In one aspect, the input device may include a gene expression profiling system, including, for example, a mass spectrometer, gene chip, or array reader, and the like.

様々な実施形態に従って、タバコ使用状態バイオマーカー情報を分析する方法及び装置は、任意の好適な様式で、例えばコンピューターシステム上で作動するコンピュータープログラムを使用して実装され得る。プロセッサー及びランダムアクセスメモリ（リモートアクセス可能なアプリケーションサーバ、ネットワークサーバ、パーソナルコンピューター、またはワークステーション等）を含む従来のコンピューターシステムが使用され得る。追加のコンピューターシステム構成要素としては、メモリーデバイスまたは情報ストレージシステム（マスストレージシステム等）及びユーザーインタフェース（例えば従来のモニター、キーボード、及びトラッキングデバイス）が挙げられ得る。コンピューターシステムは、独立型システム、またはサーバ及び１つまたは複数のデータベースを含むコンピューターのネットワークの一部であり得る。According to various embodiments, the methods and apparatus for analyzing tobacco use status biomarker information may be implemented in any suitable manner, for example, using a computer program running on a computer system. A conventional computer system including a processor and random access memory (such as a remotely accessible application server, network server, personal computer, or workstation) may be used. Additional computer system components may include a memory device or information storage system (such as a mass storage system) and a user interface (e.g., a conventional monitor, keyboard, and tracking device). The computer system may be a stand-alone system or part of a network of computers including a server and one or more databases.

タバコ使用査定のバイオマーカー分析システムは、データ分析（データ収集、プロセシング、分析、報告、及び／または診断等）を完了するための機能及びオペレーションを提供し得る。例えば一実施形態において、コンピューターシステムは、タバコ使用査定の予測バイオマーカーに関連する情報を、受信、保存、検索、分析し、報告し得るコンピュータープログラムを実行し得る。コンピュータープログラムは、様々な機能またはオペレーションを遂行する複数のモジュール（生データのプロセシング及び補足データの生成のためのプロセシングモジュールならびに生データ及び補足データを分析してタバコ使用予測状態を生成するための分析モジュール等）を含み得る。個体が「これまでに」タバコ使用者である確率の決定は、疾患、病態、もしくは事象に関連する個体の状態に関する他の情報（追加の生物医学的情報を包含する）を生成もしく収集すること、さらなる検査が所望され得るかどうかを同定すること、または個体の健康状態を他の方法で評価することを任意選択で含み得る。A tobacco use assessment biomarker analysis system may provide functions and operations for completing data analysis (e.g., data collection, processing, analysis, reporting, and/or diagnosis). For example, in one embodiment, a computer system may execute a computer program that may receive, store, retrieve, analyze, and report information related to predictive biomarkers for tobacco use assessment. The computer program may include multiple modules that perform various functions or operations (e.g., a processing module for processing raw data and generating supplemental data and an analysis module for analyzing the raw data and supplemental data to generate a predicted tobacco use status). Determining the probability that an individual is a "ever" tobacco user may optionally include generating or collecting other information (including additional biomedical information) regarding the individual's status related to a disease, condition, or event, identifying whether further testing may be desired, or otherwise assessing the individual's health status.

ここで図６を参照して、開示される実施形態の原理に従ってコンピューターを利用する方法の例が見られる。図６中で、フローチャート３０００が示される。ブロック３００４において、バイオマーカー情報は、個体についてリトリーブされ得る。バイオマーカー情報は、例えば個体の生物学的サンプルの検査が遂行された後に、コンピューターデータベースからリトリーブされ得る。バイオマーカー情報は、表４のバイオマーカーのうちの１つまたは複数に各々が対応するバイオマーカーレベルを含み得る。ブロック３００８において、コンピューターは、バイオマーカーレベルの各々を分類するために利用され得る。そして、ブロック３０１２において、決定は、個体が「これまでに」タバコ使用者である確率について、複数の分類で行われ得る。表示は、人によって視認可能なように、ディスプレイまたは他の表示デバイスに出力され得る。したがって例えば、それは、コンピューターまたは他の出力デバイスのディスプレイスクリーンにディスプレイされ得る。Referring now to FIG. 6, an example of a computer-based method according to the principles of the disclosed embodiments can be seen. In FIG. 6, a flowchart 3000 is shown. At block 3004, biomarker information can be retrieved for an individual. The biomarker information can be retrieved from a computer database, for example, after testing of the individual's biological sample has been performed. The biomarker information can include biomarker levels, each corresponding to one or more of the biomarkers in Table 4. At block 3008, a computer can be utilized to classify each of the biomarker levels. Then, at block 3012, a determination can be made in multiple classifications as to the probability that the individual is an "ever" tobacco user. The display can be output to a display or other display device so as to be viewable by a human. Thus, for example, it can be displayed on a display screen of a computer or other output device.

本明細書において記載されるいくつかの実施形態は、コンピュータープログラム製品を含むように実装され得る。コンピュータープログラム製品は、データベースを備えたコンピューターでアプリケーションプログラムを実行させるために、メディアで具現化されるコンピューター可読プログラムコードを有するコンピューター可読メディアを含み得る。Some embodiments described herein may be implemented to include a computer program product. The computer program product may include a computer-readable medium having computer-readable program code embodied in the medium for causing an application program to be executed on a computer with a database.

本明細書において使用される時、「コンピュータープログラム製品」は、任意の性質（例えば、書面による、電子的、磁気的、光学的、または他の方法）の物理的メディア上で含有され、コンピューターまたは他の自動データプロセシングシステムにより使用され得る、自然言語文またはプログラミング言語文の形態で組織化された指示のセットを指す。かかるプログラミング言語文は、コンピューターまたはデータ処理システムによって実行される場合に、命令文の特定の内容に従って、コンピューターまたはデータ処理システムを動作させる。コンピュータープログラム製品としては、ソースコード及びオブジェクトコードでのプログラムならびに／またはコンピューター可読メディアに組み込まれたテストライブラリーもしくはデータライブラリーが限定されずに挙げられる。さらに、コンピューターシステムまたはデータプロセシング機器デバイスを事前選択された方式で動作させることを可能にするコンピュータープログラム製品は、多数の形態で提供され得、当該形態としては、オリジナルソースコード、アセンブリコード、オブジェクトコード、機械言語、上記のものの暗号化バージョンまたは圧縮バージョン、ならびに任意の及びすべて均等物が挙げられるがこれらに限定されない。As used herein, a "computer program product" refers to a set of instructions, organized in the form of natural language statements or programming language statements, contained on physical media of any nature (e.g., written, electronic, magnetic, optical, or other) and usable by a computer or other automated data processing system. Such programming language statements, when executed by a computer or data processing system, cause the computer or data processing system to operate according to the specific content of the instructions. Computer program products include, but are not limited to, programs in source code and object code and/or test or data libraries embodied in computer-readable media. Furthermore, computer program products that enable a computer system or data processing device to operate in a preselected manner may be provided in numerous forms, including, but not limited to, original source code, assembly code, object code, machine language, encrypted or compressed versions of the above, and any and all equivalents.

一態様において、コンピュータープログラム製品は、タバコ使用状態の査定のために提供される。コンピュータープログラム製品は、コンピューティングデバイスまたはシステムのプロセッサーによって実行可能なプログラムコードを具現化するコンピューター可読メディアを含み、プログラムコードは、個体からの生物学的サンプルに帰属するデータをリトリーブするコードであって、データが、表４のバイオマーカーのうちの１つまたは複数に各々が対応するバイオマーカー値を含む、前記コード；及びバイオマーカー値の関数として個体の「これまでの」タバコ使用者状態の確率を示す分類方法を実行するコードを含む。In one aspect, a computer program product is provided for assessing tobacco use status. The computer program product includes a computer-readable medium embodying program code executable by a processor of a computing device or system, the program code including: code for retrieving data belonging to a biological sample from an individual, the data including biomarker values each corresponding to one or more of the biomarkers in Table 4; and code for implementing a classification method that indicates the probability of "ever" tobacco user status for the individual as a function of the biomarker values.

様々な実施形態が方法または装置として記載されてきたが、実施形態は、コンピューターとカップルされたコード（例えばコンピューター上に常駐するコードまたはコンピューターによってアクセス可能であるコード）を介して実装され得ることを理解されたい。例えば、ソフトウェア及びデータベースは、上で考察された多数の方法を実装するために利用され得る。したがって、ハードウェアによって達成される実施形態に加えて、これらの実施形態は、本明細書において開示される機能の実行可能性を引き起こす、その中に具現化されたコンピューター可読プログラムコードを有するコンピューター使用可能なメディアから構成される製造品の使用を介して達成され得ることにも留意されたい。したがって、実施形態は、それらのプログラムコード手段においても本特許によって同様に保護されたと判断されることが所望される。さらに、実施形態は、事実上任意の種類のコンピューター可読メモリーにおいても保存されたコードとして具現化され得、当該メモリーとしては、ＲＡＭ、ＲＯＭ、磁気メディア、光学的メディア、または磁気－光学メディアが限定されずに挙げられる。さらにより一般的には、実施形態は、ソフトウェアもしくはハードウェア、または任意のそれらの組み合わせにおいて実装され得、これらのものとしては、汎用プロセッサーで作動するソフトウェア、マイクロコード、ＰＬＡ、またはＡＳＩＣが挙げられるがこれらに限定されない。While various embodiments have been described as methods or apparatus, it should be understood that embodiments may be implemented via code coupled to a computer (e.g., code resident on or accessible by a computer). For example, software and databases may be utilized to implement many of the methods discussed above. Accordingly, in addition to embodiments achieved via hardware, it should also be noted that these embodiments may be achieved via the use of an article of manufacture comprising a computer-usable medium having computer-readable program code embodied therein that causes the performance of the functions disclosed herein. Accordingly, it is desired that embodiments be deemed protected by this patent in their program code form as well. Furthermore, embodiments may be embodied as code stored in virtually any type of computer-readable memory, including, but not limited to, RAM, ROM, magnetic media, optical media, or magneto-optical media. Even more generally, embodiments may be implemented in software or hardware, or any combination thereof, including, but not limited to, software running on a general-purpose processor, microcode, a PLA, or an ASIC.

実施形態が、搬送波において具現化されたコンピューターシグナルとして、そして伝送メディアを介して伝播されるシグナル（例えば電気的で及び光学的）として達成され得ることも想定される。したがって、上で考察された様々なタイプの情報は、構造（データ構造等）でフォーマットされ得、伝送メディアを介して電気シグナルとして送信され得るか、またはコンピューター可読メディア上に保存され得る。It is also contemplated that embodiments may be achieved as a computer signal embodied in a carrier wave and as a signal propagated over a transmission medium (e.g., electrical and optical). Thus, various types of information discussed above may be formatted in a structure (e.g., a data structure) and transmitted as an electrical signal over a transmission medium or stored on a computer-readable medium.

本明細書において列挙される多数の構造、材料、及び行為が、機能の遂行のための手段または機能の遂行のためのステップとして列挙され得ることにも留意されたい。したがって、かかる文言は、参照によって援用される事項を含め、本明細内に開示されるすべてのかかる構造、材料、または作用、及びそれらの均等物をカバーする権利を有することを理解されたい。It should also be noted that many of the structures, materials, and acts recited herein may be recited as a means for performing a function or a step for performing a function. Accordingly, such language should be understood to be entitled to cover all such structures, materials, or acts disclosed herein, including those incorporated by reference, and their equivalents.

本明細書において開示されるバイオマーカー同定プロセス、バイオマーカーの利用、及びバイオマーカー値を決定するための様々な方法は、個体が「これまでに」タバコ使用者である確率の評価に関して詳細に上で記載される。しかしながら、プロセスの適用、同定されたバイオマーカーの使用、及びバイオマーカー値を決定する方法は、他の特定のタイプの疾患もしくは医学的病態、または補助的な医学的治療によって利益を得るかもしくは得ない個体の同定に十分に適用可能である。The biomarker identification process, uses of the biomarkers, and various methods for determining biomarker values disclosed herein are described in detail above with respect to assessing the probability that an individual is a "ever" tobacco user. However, the application of the process, uses of the identified biomarkers, and methods for determining biomarker values are fully applicable to identifying other specific types of diseases or medical conditions, or individuals who would or would not benefit from adjunctive medical treatment.

他の方法
いくつかの実施形態において、本明細書において記載されるバイオマーカー及び方法は、医療保険料もしくは補償範囲の判定及び／または生命保険料もしくは補償範囲の判定を決定するために使用される。いくつかの実施形態において、本明細書において記載される方法の結果は、医療保険料及び／または生命保険料を決定するために使用される。いくつかのかかる実例において、医療保険または生命保険を提供する組織は、対象のタバコ使用状態に関する情報をリクエストするかまたは他の方法で得て、その情報を使用して対象について適切な医療保険料または生命保険料を決定する。いくつかの実施形態において、医療保険または生命保険を提供する組織によって検査がリクエストされ、当該組織によって支払われる。いくつかの実施形態において、この検査は、買収が進んだ場合の将来の負債または費用を予測するために、実務または保健制度または会社を買収する可能性のある者によって使用される。 Other Methods In some embodiments, the biomarkers and methods described herein are used to determine health insurance premiums or coverage determinations and/or life insurance premiums or coverage determinations. In some embodiments, the results of the methods described herein are used to determine health insurance premiums and/or life insurance premiums. In some such instances, an organization providing health insurance or life insurance requests or otherwise obtains information regarding a subject's tobacco use status and uses that information to determine appropriate health insurance or life insurance premiums for the subject. In some embodiments, the test is requested by and paid for by an organization providing health insurance or life insurance. In some embodiments, the test is used by a potential acquirer of a practice or health system or company to predict future liabilities or costs if the acquisition proceeds.

いくつかの実施形態において、本明細書において記載されるバイオマーカー及び方法は、医療資源の利用を予測及び／または管理するために使用される。いくつかのかかる実施形態において、方法はかかる予測の目的のために実行されないが、方法から得られる情報は、医療資源の利用のかかる予測及び／または管理において使用される。例えば、検査施設または病院は、特定の設備または特定の地理的領域における医療資源の利用を予測及び／または管理するために、本方法からの多数の対象についての情報を集め得る。In some embodiments, the biomarkers and methods described herein are used to predict and/or manage healthcare resource utilization. In some such embodiments, the method is not performed for the purpose of such prediction, but information obtained from the method is used in such prediction and/or management of healthcare resource utilization. For example, a laboratory or hospital may collect information about a large number of subjects from the method to predict and/or manage healthcare resource utilization in a particular facility or in a particular geographic area.

以下の実施例は、例示の目的のためのみに提供され、添付の特許請求の範囲によって定義される本出願の範囲を限定することは意図されない。本明細書において記載されるすべての実施例は、当業者に周知でルーチンの標準的な技法を使用して実行された。以下の実施例において記載されるルーチンの分子生物学技法は、標準的な実験手引き書（Ｓａｍｂｒｏｏｋｅｔａｌ．，ＭｏｌｅｃｕｌａｒＣｌｏｎｉｎｇ：ＡＬａｂｏｒａｔｏｒｙＭａｎｕａｌ，３ｒｄ．ｅｄ．，ＣｏｌｄＳｐｒｉｎｇＨａｒｂｏｒＬａｂｏｒａｔｏｒｙＰｒｅｓｓ，ＣｏｌｄＳｐｒｉｎｇＨａｒｂｏｒ，Ｎ．Ｙ．，（２００１）等）において記載されるように実行され得る。The following examples are provided for illustrative purposes only and are not intended to limit the scope of the present application, which is defined by the appended claims. All examples described herein were carried out using standard techniques well known and routine to those skilled in the art. The routine molecular biology techniques described in the following examples can be carried out as described in standard laboratory manuals (e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual, 3rd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., (2001)).

実施例１：バイオマーカー同定のためのマルチプレックスアプタマーアッセイ及び統計的アプローチ
検査サンプル及び対照サンプルを分析して、個体が「これまでに」タバコ使用者である確率の予測バイオマーカーを同定するために、マルチプレックスアプタマーアッセイを使用した。本実験において使用されるマルチプレックス分析は、小さなサンプル体積（約６５μｌの血清または血漿）から、血液中のおよそ５，０００タンパク質を、低い検出限界（１ｐＭ中央値）、約７ｌｏｇのダイナミックレンジ、及び約５％の変動係数の中央値で検出するためのアプタマーを含んでいた。マルチプレックスアプタマーアッセイは、概して例えばＧｏｌｄｅｔａｌ．（２０１０）Ａｐｔａｍｅｒ－ＢａｓｅｄＭｕｌｔｉｐｌｅｘｅｄＰｒｏｔｅｏｍｉｃＴｅｃｈｎｏｌｏｇｙｆｏｒＢｉｏｍａｒｋｅｒＤｉｓｃｏｖｅｒｙ．ＰＬｏＳＯＮＥ５（１２）：ｅ１５００４；ならびに米国特許出願公開第２０１２／０１０１００２号及び同第２０１２／００７７６９５号中で記載される。 Example 1: Multiplexed Aptamer Assay and Statistical Approach for Biomarker Identification A multiplexed aptamer assay was used to analyze test and control samples to identify predictive biomarkers for the probability that an individual is an "ever" tobacco user. The multiplexed assay used in this experiment included aptamers for detecting approximately 5,000 proteins in blood from a small sample volume (approximately 65 μl of serum or plasma) with a low detection limit (1 pM median), a dynamic range of approximately 7 logs, and a median coefficient of variation of approximately 5%. Multiplexed aptamer assays are generally described, for example, in Gold et al. (2010) Aptamer-Based Multiplexed Proteomic Technology for Biomarker Discovery. PLoS ONE 5(12):e15004; and U.S. Patent Application Publication Nos. 2012/0101002 and 2012/0077695.

実施例２：モデルの仕様
主要評価項目はタバコ使用状態であり、自己申告したタバコ使用習慣のカテゴリの評価項目であり、それは、３レベル：「１度もタバコを使用したことがない」者、「過去の」タバコ使用者、または「現在の」タバコ使用者であった。「過去の」（例えば「以前の」）タバコ使用者を、「現在の」タバコ使用者と合わせて、「これまでの」とラベル付けし、モデル開発のために「１度もタバコを使用したことがない」群に比較した。 Example 2: Model Specification The primary outcome was tobacco use status, a categorical measure of self-reported tobacco use habits, which had three levels: "never tobacco user,""past" tobacco user, or "current" tobacco user. "Past" (e.g., "former") tobacco users were combined with "current" tobacco users, labeled "ever," and compared to the "never tobacco user" group for model development.

選択したモデルは、３９フィーチャ、タンパク質のみ（表４を参照）のエラスティックネットロジスティック回帰モデルである。このモデルを、「これまでの」（「現在の」及び「以前の」）タバコ使用者または「１度もタバコを使用したことがない」者の二項分類評価項目で訓練した。The model selected was a 39-feature, protein-only (see Table 4) elastic net logistic regression model. The model was trained on a binary classification endpoint of "ever" ("current" and "former") tobacco users or "never" tobacco users.

モデルは、単一の予測出力、予測確率を提供する。この出力は、採血時で「これまでの」タバコ使用者である絶対確率（すなわち絶対リスク）であり、値は０～１である。絶対確率に基づいて３位に丸めて、予測値を、２つのクラス：「１度もタバコを使用したことがない」者及び「これまでの」タバコ使用者へと分類する。以下の表３は、絶対確率範囲及びそれに関連するカテゴリラベルを示す。最終的なモデルについての受信者動作特性（ＲＯＣ）曲線を図１中で示し、「１度もタバコを使用ことがない」者及び「これまでの」タバコ使用者の分類についての予測確率のボックスプロットを図２中で示す。The model provides a single prediction output, predicted probability. This output is the absolute probability (i.e., absolute risk) of being an "ever" tobacco user at the time of blood draw, and ranges in value from 0 to 1. Based on the absolute probability and rounded to thirds, the predictions are categorized into two classes: "never" tobacco users and "ever" tobacco users. Table 3 below shows the absolute probability ranges and their associated category labels. The receiver operating characteristic (ROC) curve for the final model is shown in Figure 1, and boxplots of the predicted probabilities for the "never" and "ever" tobacco user classifications are shown in Figure 2.

表５は、モデルのための性能メトリクスを示す。
Table 5 shows the performance metrics for the model.

モデル出力は、採血時での「これまでの」タバコ使用者または「１度もタバコを使用したことがない」者の二項分類である。０．４６０未満の予測確率を、「１度もタバコを使用したことがない」者、及びさもなければ「これまでの」タバコ使用者としてラベル付けした。ＲＵＯ目的のために、これまでにタバコ使用者であった確率は、予測確率スコアと共にカテゴリクラス（「これまでにあった」ｖｓ「１度もない」）として報告されるだろう。このモデルの出力が確率に基づくので、［０、１］の範囲外の値は不全であり、報告されないだろう。
The model output is a binary classification of "ever" tobacco user or "never" tobacco user at the time of blood draw. Predicted probabilities less than 0.460 were labeled as "never" tobacco users, and otherwise as "ever" tobacco users. For RUO purposes, the probability of being an ever tobacco user will be reported as a categorical class ("ever" vs. "never") along with the predicted probability score. Because the output of this model is based on probability, values outside the range [0, 1] are insufficient and will not be reported.

実施例３：検査開発及び妥当性確認のためのデータセット
開発及び妥当性確認コホート（複数可）。Ｃｏｖａｎｃｅ試験は、バイオマーカー試験のための健康な個体からの血液サンプル及びベースライン情報の収集とも呼ばれ、米国の６つの地域：Ａｕｓｔｉｎ、ＴＸ；Ｂｏｉｓｅ、ＩＤ；Ｄａｌｌａｓ、ＴＸ；Ｈｏｎｏｌｕｌｕ、ＨＩ；Ｐｏｒｔｌａｎｄ、ＯＲ；ＳａｎＤｉｅｇｏ、ＣＡにわたって遂行された観察試験である。２００８年～２００９年に、Ｃｏｖａｎｃｅの試験で、米国の自己申告病歴及び人口層の民族／人種及び性別の層と一致する１９～９０歳の１０２９名の個体を登録し、そして血液サンプルを単一の時点で採取した。本検査開発のために、１９～８９歳の１，０２０名の個体について利用可能な血漿サンプルがあった。 Example 3: Dataset for Test Development and Validation Development and Validation Cohort(s). The Covance study, also referred to as the collection of blood samples and baseline information from healthy individuals for biomarker testing, is an observational study conducted across six regions of the United States: Austin, TX; Boise, ID; Dallas, TX; Honolulu, HI; Portland, OR; and San Diego, CA. In 2008-2009, the Covance study enrolled 1,029 individuals aged 19-90 years who matched the ethnic/racial and gender demographics of the U.S. population with self-reported medical history, and blood samples were collected at a single time point. For this test development, there were plasma samples available for 1,020 individuals aged 19-89 years.

本試験についての除外基準については、
１．コントロール不良な高血圧（すなわち、１０分間隔の２回の測定値＞１６０／９５）；
２．過去２年間の皮膚の扁平上皮細胞癌または基底細胞癌以外の悪性腫瘍についての自己申告した治療；
３．自己申告した妊娠；
４．自己申告した慣性的な感染性病態（複数可）（例えばＢ型肝炎、Ｃ型肝炎、ＨＩＶ）、自己免疫性病態（複数可）、または他の炎症性病態（複数可）（ＳＬＥ、強皮症、ＭＳ、クローン病、または潰瘍性大腸炎等）；
５．自己申告した慣性的な腎臓疾患または肝臓疾患、
６．自己申告した慢性心不全、または過去３か月において心筋梗塞と診断された；
７．自己申告したコントロール不良な糖尿病（もし分かるならばＨｂＡ１ｃ＞８）；
８．自己申告した急性のウイルス感染もしくは細菌感染、または登録の２４時間以内の温度＞３８℃；
９．血液サンプリング前１４日間以内の、自己申告した任意の治療試験への参加；
１０．２０ｍｇ／日を超えるのプレドニゾンまたは関連薬物の摂取（自己申告）が含まれていた。 The exclusion criteria for this study are as follows:
1. Uncontrolled hypertension (i.e., two readings 10 minutes apart >160/95);
2. Self-reported treatment for malignancies other than squamous cell or basal cell carcinoma of the skin within the past 2 years;
3. Self-reported pregnancy;
4. Self-reported chronic infectious condition(s) (e.g., hepatitis B, hepatitis C, HIV), autoimmune condition(s), or other inflammatory condition(s) (such as SLE, scleroderma, MS, Crohn's disease, or ulcerative colitis);
5. Self-reported chronic kidney or liver disease;
6. Self-reported chronic heart failure or diagnosed myocardial infarction in the past 3 months;
7. Self-reported uncontrolled diabetes (HbA1c >8, if known);
8. Self-reported acute viral or bacterial infection or temperature >38°C within 24 hours of enrollment;
9. Self-reported participation in any therapeutic trial within 14 days prior to blood sampling;
Self-reported intake of prednisone or related medications greater than 10.20 mg/day was included.

選択したモデルは、採血時での自己申告のタバコ使用状態に基づいて開発されている。Ｃｏｖａｎｃｅのデータセットは、米国のタバコ使用率を表すものであり、全体では、「現在の」タバコ使用者の普及率は１４．６％（モデル開発データセットと妥当性確認データセットとの間で１３～１５％）である。公衆衛生局長官からの２０２０年の報告により、米国における「現在の」喫煙者の普及率は、２０１８年に今までの最低の集団の１３．８％へ低下することが見出された（査定は１９６５年に開始）。Ｃｏｖａｎｃｅのデータセットにおける「以前の」喫煙者の全体的な普及率は３３．３％（モデル開発データセットと妥当性確認データセットとの間で３３～３５％）であり、２０１８年の普及率の２０．９％よりも高く、これは、サンプリング場所または人口層（人種／民族等）におけるバイアスに起因し得る（ＵｎｉｔｅｄＳｔａｔｅｓＰｕｂｌｉｃＨｅａｌｔｈＳｅｒｖｉｃｅＯｆｆｉｃｅｏｆｔｈｅＳｕｒｇｅｏｎＧｅｎｅｒａｌ；ＮａｔｉｏｎａｌＣｅｎｔｅｒｆｏｒＣｈｒｏｎｉｃＤｉｓｅａｓｅＰｒｅｖｅｎｔｉｏｎａｎｄＨｅａｌｔｈＰｒｏｍｏｔｉｏｎ（ＵＳ）ＯｆｆｉｃｅｏｎＳｍｏｋｉｎｇａｎｄＨｅａｌｔｈ．ＳｍｏｋｉｎｇＣｅｓｓａｔｉｏｎ：ＡＲｅｐｏｒｔｏｆｔｈｅＳｕｒｇｅｏｎＧｅｎｅｒａｌ．Ｗａｓｈｉｎｇｔｏｎ（ＤＣ）：ＵＳＤｅｐａｒｔｍｅｎｔｏｆＨｅａｌｔｈａｎｄＨｕｍａｎＳｅｒｖｉｃｅｓ；２０２０）。The selected model was developed based on self-reported tobacco use status at the time of blood draw. The Covance dataset is representative of tobacco use in the United States, with an overall prevalence of "current" tobacco users of 14.6% (13-15% between the model development and validation datasets). A 2020 report from the Surgeon General found that the prevalence of "current" smokers in the United States fell to its lowest population to date, 13.8%, in 2018 (assessments began in 1965). The overall prevalence of "former" smokers in the Covance dataset was 33.3% (33-35% between the model development and validation datasets), higher than the 2018 prevalence of 20.9%, which may be due to bias in sampling location or demographics (e.g., race/ethnicity) (United States Public Health Service Office of the Surgeon General; National Center for Chronic Disease Prevention and Health Promotion (US) Office on Smoking and Health. Smoking Cessation: A Report of the Surgeon General). General. Washington (DC): US Department of Health and Human Services; 2020).

データセットの層別化。この検査については、データを、３つのセット（訓練７０％／検証１５％／妥当性確認１５％）へと独立して分割し、これまでのタバコ使用者ｖｓ１度もタバコを使用したことがない者に層別化し、過剰適合問題を軽減しながら、ロバストなモデルの同定を可能にした。妥当性確認データセットは、ＰＯＣまたは精密化のステージで使用しなかった。Dataset stratification. For this study, the data were independently split into three sets (70% training/15% validation/15%) and stratified into ever tobacco users vs. never tobacco users to allow for the identification of robust models while mitigating overfitting issues. The validation dataset was not used in the POC or refinement stages.

モデル開発データ
Model development data

モデル検証データ
Model Validation Data

モデル妥当性確認データ
Model Validation Data

実施例４：開発からの結果
データＱＣ及び前分析の結果。元のデータセットは１，０２０のサンプルを含んでいたが、データＱＣ分析結果に基づいて、いくつかのサンプルを削除した。削除を表１０中で詳述する。
Example 4: Results from Development Data QC and Pre-Analysis Results. The original dataset contained 1,020 samples, but some samples were deleted based on the data QC analysis results. The deletions are detailed in Table 10.

追加で、３６３の分析物が標的確認特異度検査に合格しなかったため、それらを分析開始前に削除し、７，２３３の分析物が分析のために利用可能となった。データＱＣまたは前分析の間に、他の問題は同定されなかった。An additional 363 analytes failed target confirmation specificity testing and were removed prior to analysis, leaving 7,233 analytes available for analysis. No other issues were identified during data QC or pre-analysis.

ＰＯＣアプローチ及び結果。モデル性能要件は、ＡＵＣ≧０．７１７により満たされた。Proof-of-concept approach and results. Model performance requirements were met by an AUC ≥ 0.717.

精密化アプローチ及び結果。タバコ使用状態検査のための最終的なモデルは、３９の分析物の、タンパク質のみのエラスティックネットロジスティック回帰モデルである。モデルを、７０％のＣｏｖａｎｃｅデータセットで訓練した。検証メトリックを分離した１５％のデータセットで計算し、追加の１５％のデータセットを妥当性確認において使用するためにホールドアウトした。主要なモデル出力は、採血時でこれまでにタバコ使用者であった予測確率である。Refinement approach and results. The final model for the tobacco use status test is a 39-analyte, protein-only elastic net logistic regression model. The model was trained on 70% of the Covance dataset. Validation metrics were calculated on a separate 15% dataset, and an additional 15% dataset was held out for use in validation. The primary model output is the predicted probability of ever being a tobacco user at the time of blood draw.

フィーチャ選択は、ＰＯＣの間の単変量解析のために有意なフィーチャに基づいた。特に、「これまでにあった」ｖｓ「１度もない」、「以前の」ｖｓ「現在の」、「１度もない」ｖｓ「現在の」、及び「１度もない」ｖｓ「以前の」の比較からの上位の単変量フィーチャは、「これまでにあった」ｖｓ「１度もない」モデルにおける組み入れについて考慮された。Feature selection was based on significant features for univariate analysis during POC. In particular, the top univariate features from the "ever" vs. "never," "previous" vs. "current," "never" vs. "current," and "never" vs. "previous" comparisons were considered for inclusion in the "ever" vs. "never" model.

候補モデルは、１０分割及び５反復による交差検証を使用して適合させた。クラス不均衡を軽減するために、アップサンプリングを利用した。「これまでにあった」ｖｓ「１度もない」の評価項目についての交差検証ＡＵＣは、モデル性能を評価するために使用される主要なメトリクスであった。交差検証ＡＵＣが、完全な訓練セットで計算したＡＵＣの９０％未満のモデルは、過剰適合しているために、検討から除外した。Candidate models were fitted using cross-validation with 10 folds and 5 replicates. Upsampling was used to reduce class imbalance. Cross-validation AUC for the "ever" vs. "never" endpoint was the primary metric used to evaluate model performance. Models with cross-validation AUC less than 90% of the AUC calculated on the full training set were overfitted and were removed from consideration.

２７の異なるフィーチャセットで構築したロジスティック回帰モデルを比較した。これらのうちで、１４のモデルがそのＡＵＣ閾値を満たし、過剰適合の徴候を示さなかった。上位３つのモデルを、検証データセットでのＡＵＣに基づいて選んだ。Logistic regression models built with 27 different feature sets were compared. Of these, 14 models met their AUC threshold and showed no signs of overfitting. The top three models were selected based on AUC on the validation dataset.

選択したモデルは、様々なモデル堅牢化尺度を介して査定されるロバスト性に基づいて選ばれた。特に、最終的なモデルは、最小の許容区間を有し、ＱＣ反復検査に合格した。The selected models were chosen based on their robustness, as assessed through various model robustness measures. In particular, the final models had the smallest tolerance intervals and passed QC replicate tests.

最終的なモデルはエラスティックネット線形回帰モデルであり、３９の非ゼロフィーチャ及び４のゼロ係数フィーチャを有し、α＝０．１００及びλ＝０．０２５４である。開発において、判定カットオフ（「これまでにあった」ｖｓ「１度もない」のラベル付けのため）を、小数点２桁までの最適決定カットオフの平均である０．４６として設定した。ここで、「最適」は、訓練及び検証の両方のデータセットで計算される「これまでにあった」ｖｓ「１度もない」の検査についてＹｏｕｄｅｎのＪ^２１を最大化するカットオフであるように選ばれた。最終的なモデルは、０．４６０の判定カットオフを使用して、小数点３桁での予測確率を報告するだろう。 The final model was an elastic net linear regression model with 39 non-zero features and 4 zero-coefficient features, α=0.100, and λ=0.0254. In development, the decision cutoff (for labeling "ever" vs. "never") was set as 0.46, the average of the optimal decision cutoffs to two decimal points, where "optimal" was chosen to be the cutoff that maximized Youden's^J for testing "ever" vs. "never" computed on both the training and validation datasets. The final model will report predicted probabilities to three decimal points using a decision cutoff of 0.460.

訓練データ及び検証データについての性能メトリックを、表１１中で示す。訓練及び検証のＡＵＣ値は、０．７１７以上のＡＵＣの要件を満たす。
The performance metrics for the training and validation data are shown in Table 11. The AUC values for training and validation meet the requirement of an AUC of 0.717 or greater.

「これまでの」タバコ使用者クラス内の「現在の」タバコ使用者及び「以前の」タバコ使用者を層別化する性能。予測確率を、「現在の」、「以前の」、及び「１度もない」の３つのタバコ使用クラスにわたって多層化した。Performance in stratifying "current" and "former" tobacco users within the "ever" tobacco user class. Predicted probabilities were stratified across the three tobacco use classes: "current," "former," and "never."

実施例５：モデル妥当性確認の計画及び結果
臨床的妥当性確認の計画。妥当性確認を、まだ使用されていないＣｏｖａｎｃｅのデータのうちの１５％のホールドアウトデータセットで査定するだろう。精密化からの最終的なモデルは、３９フィーチャのエラスティックネットロジスティック回帰モデルである。「これまでの」タバコ使用者であることについての予測確率は、この最終的なモデルを使用して計算され、これらの予測値は、真の「これまでにあった」ｖｓ「１度もない」の真のスタンダードへ比較されるだろう。モデルは、妥当性確認データセットで、少なくとも０．６５、０．６６、０．６７、０．６８、０．６９、０．７０、０．７１７、またはそれらを超えるＡＵＣを有する必要があるだろう。 Example 5: Model Validation Plan and Results Clinical Validation Plan. Validation will be assessed on a holdout dataset of 15% of the unused Covance data. The final model from refinement will be a 39-feature elastic net logistic regression model. Predicted probabilities for being an "ever" tobacco user will be calculated using this final model, and these predictions will be compared to the true standards of "ever" vs. "never." Models will be required to have an AUC of at least 0.65, 0.66, 0.67, 0.68, 0.69, 0.70, 0.717, or greater on the validation dataset.

妥当性確認データについての臨床結果。妥当性確認に要求される合格基準は、タバコ使用状態モデルが、妥当性確認データセットで、少なくとも０．６５、０．６６、０．６７、０．６８、０．６９、０．７０、０．７１７、またはそれらを超えるＡＵＣを有するというものであった。モデルは、表１２中で示すように、０．７２９に等しいＡＵＣにより妥当性確認に合格した。
Clinical Results for Validation Data. The required passing criterion for validation was that the tobacco use status model have an AUC of at least 0.65, 0.66, 0.67, 0.68, 0.69, 0.70, 0.717, or greater on the validation data set. The model passed validation with an AUC equal to 0.729, as shown in Table 12.

モデル性能メトリクスは、訓練データセットまたは検証のデータセットにおいてよりも、妥当性確認データでわずかに低下する。しかしながら、ＡＵＣは、性能基準に合格するほどに十分に高い。Model performance metrics are slightly lower on the validation data than on the training or validation data sets. However, the AUC is still high enough to pass the performance criteria.

以下の表１３中のシミュレーションからの結果は、範囲外のアプタマーを取り扱うために最も良好なアプローチが分析物の削除であることを示す。The results from the simulations in Table 13 below show that analyte deletion is the best approach for dealing with outlying aptamers.

許容誤差限界シミュレーションからの結果は、アッセイノイズからもたらされるモデルにおける分散を説明する。線形空間における９５％及び９９％の上限及び下限、ならびに確率空間における最大限界を、表１４中で記載する。
The results from the margin of error simulation account for the variance in the model resulting from assay noise. The 95% and 99% upper and lower bounds in linear space and the maximum bound in probability space are listed in Table 14.

いくつかの実施形態において、１つまたは複数のバイオマーカーは、単変量性能の分析及び／または決定において含まれ得る。非限定例によって、バイオマーカーとしては、ｄＣＴＰピロホスファターゼ１（標的名称ＸＴＰ３Ａ、Ｅｎｔｒｅｚ遺伝子記号ＤＣＴＰＰ１、ＵｎｉｐｒｏｔＩＤＱ９Ｈ７７３）；マトリックスリモデリング関連タンパク質８（標的名称ＭＸＲＡ８、Ｅｎｔｒｅｚ遺伝子記号ＭＸＲＡ８、ＵｎｉｐｒｏｔＩＤＱ９ＢＲＫ３）；Ｔ複合体タンパク質１１様タンパク質１（標的名称Ｔ１１Ｌ１、Ｅｎｔｒｅｚ遺伝子記号ＴＣＰ１１Ｌ１、ＵｎｉｐｒｏｔＩＤＱ９ＮＵＪ３）、及び／または軟骨酸性タンパク質１（標的名称ＣＲＡＣ１、Ｅｎｔｒｅｚ遺伝子記号ＣＲＴＡＣ１、ＵｎｉｐｒｏｔＩＤＱ９ＮＱ７９）が挙げられ得る。単変量性能を表２５中で示す。In some embodiments, one or more biomarkers may be included in the analysis and/or determination of univariate performance. By non-limiting example, biomarkers may include dCTP pyrophosphatase 1 (target name XTP3A, Entrez gene symbol DCTPP1, Uniprot ID Q9H773); matrix remodeling associated protein 8 (target name MXRA8, Entrez gene symbol MXRA8, Uniprot ID Q9BRK3); T-complex protein 11-like protein 1 (target name T11L1, Entrez gene symbol TCP11L1, Uniprot ID Q9NUJ3), and/or cartilage acidic protein 1 (target name CRAC1, Entrez gene symbol CRTAC1, Uniprot ID Q9NQ79). Univariate performance is shown in Table 25.

実施例６：タバコ使用モデルバイオマーカーパネルの分析
表４中でリストしたバイオマーカーの様々な組み合わせを含むモデルバイオマーカーパネルを分析して、様々な組み合わせについての曲線下面積（ＡＵＣ）値を決定した。モデルバイオマーカーパネルは、少なくとも０．６５、少なくとも０．６６、少なくとも０．６７、少なくとも０．６８、少なくとも０．６９、少なくとも０．７、少なくとも０．７５、少なくとも０．８、少なくとも０．８５、少なくとも０．９、または少なくとも０．９５のＡＵＣ値を有するＮ個のバイオマーカータンパク質のパネルに基づき得、Ｎは、表４中でリストしたバイオマーカータンパク質のうちの１、２、３、４、５、６、７、８、９、１０、１１、１２、１３、１４、１５、１６、１７、１８、１９、２０、２１、２２、２３、２４、２６、２７、２８、２９、３０、３１、３２、３３、３４、３５、３６、３７、３８、及び／または３９個である。以下の表は、１～３９のバイオマーカータンパク質を含む様々な組み合わせが測定された場合の例示的なモデルの結果を示す。
Example 6: Analysis of a Tobacco Use Model Biomarker Panel A model biomarker panel comprising various combinations of the biomarkers listed in Table 4 was analyzed to determine area under the curve (AUC) values for the various combinations. The model biomarker panel can be based on a panel of N biomarker proteins having an AUC value of at least 0.65, at least 0.66, at least 0.67, at least 0.68, at least 0.69, at least 0.7, at least 0.75, at least 0.8, at least 0.85, at least 0.9, or at least 0.95, where N is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, and/or 39 of the biomarker proteins listed in Table 4. The following table shows the results of an exemplary model when various combinations comprising 1-39 biomarker proteins are measured.