General Information of the Compound
Compound ID
CP0315041
Compound Name
18-[[(1R)-4-[2-[2-[2-[2-[2-[2-[[(5S)-5-[[(2S)-2-[[(2S)-2-[[(2S)-5-amino-2-[[2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S,3R)-2-[[(2S)-2-[[(2S,3R)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-amino-3-(1H-imidazol-5-yl)propanoyl]amino]-2-methylpropanoyl]amino]-4-carboxybutanoyl]amino]acetyl]amino]-3-hydroxybutanoyl]amino]-3-phenylpropanoyl]amino]-3-hydroxybutanoyl]amino]-3-hydroxypropanoyl]amino]-3-carboxypropanoyl]amino]-3-methylbutanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-4-methylpentanoyl]amino]-4-carboxybutanoyl]amino]acetyl]amino]-5-oxopentanoyl]amino]propanoyl]amino]propanoyl]amino]-6-[[(2S)-1-[[(2S)-1-[[(2S,3S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-5-carbamimidamido-1-[[2-[[(2S)-5-carbamimidamido-1-(carboxymethylamino)-1-oxopentan-2-yl]amino]-2-oxoethyl]amino]-1-oxopentan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-3-methyl-1-oxopentan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-4-carboxy-1-oxobutan-2-yl]amino]-6-oxohexyl]amino]-2-oxoethoxy]ethoxy]ethylamino]-2-oxoethoxy]ethoxy]ethylamino]-1-carboxy-4-oxobutyl]amino]-18-oxooctadecanoic acid
    Show/Hide
Synonyms
NN9535
Semaglutide
    Show/Hide
Structure
Formula
C187H291N45O59
Molecular Weight
4113.641
Canonical SMILES
CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCNC(=O)COCCOCCNC(=O)COCCOCCNC(=O)CC[C@@H](NC(=O)CCCCCCCCCCCCCCCCC(O)=O)C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)C(C)(C)NC(=O)[C@@H](N)Cc1c[nH]cn1)[C@@H](C)O)[C@@H](C)O)C(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O
    Show/Hide
InChI
InChI=1S/C187H291N45O59/c1-18-105(10)154(180(282)208-108(13)159(261)216-133(86-114-89-200-119-50-40-39-49-117(114)119)170(272)218-129(82-102(4)5)171(273)228-152(103(6)7)178(280)215-121(53-44-72-199-186(192)193)162(264)201-91-141(242)209-120(52-43-71-198-185(190)191)161(263)204-94-151(257)258)230-172(274)131(83-111-45-33-31-34-46-111)219-167(269)126(64-69-149(253)254)214-166(268)122(51-41-42-70-195-144(245)98-290-79-78-289-76-74-197-145(246)99-291-80-77-288-75-73-196-139(240)66-61-127(183(285)286)211-140(241)54-37-29-27-25-23-21-19-20-22-24-26-28-30-38-55-146(247)248)212-158(260)107(12)206-157(259)106(11)207-165(267)125(60-65-138(189)239)210-142(243)92-202-163(265)123(62-67-147(249)250)213-168(270)128(81-101(2)3)217-169(271)130(85-113-56-58-116(238)59-57-113)220-175(277)135(95-233)223-177(279)137(97-235)224-179(281)153(104(8)9)229-174(276)134(88-150(255)256)221-176(278)136(96-234)225-182(284)156(110(15)237)231-173(275)132(84-112-47-35-32-36-48-112)222-181(283)155(109(14)236)227-143(244)93-203-164(266)124(63-68-148(251)252)226-184(287)187(16,17)232-160(262)118(188)87-115-90-194-100-205-115/h31-36,39-40,45-50,56-59,89-90,100-110,118,120-137,152-156,200,233-238H,18-30,37-38,41-44,51-55,60-88,91-99,188H2,1-17H3,(H2,189,239)(H,194,205)(H,195,245)(H,196,240)(H,197,246)(H,201,264)(H,202,265)(H,203,266)(H,204,263)(H,206,259)(H,207,267)(H,208,282)(H,209,242)(H,210,243)(H,211,241)(H,212,260)(H,213,270)(H,214,268)(H,215,280)(H,216,261)(H,217,271)(H,218,272)(H,219,269)(H,220,277)(H,221,278)(H,222,283)(H,223,279)(H,224,281)(H,225,284)(H,226,287)(H,227,244)(H,228,273)(H,229,276)(H,230,274)(H,231,275)(H,232,262)(H,247,248)(H,249,250)(H,251,252)(H,253,254)(H,255,256)(H,257,258)(H,285,286)(H4,190,191,198)(H4,192,193,199)/t105-,106-,107-,108-,109+,110+,118-,120-,121-,122-,123-,124-,125-,126-,127+,128-,129-,130-,131-,132-,133-,134-,135-,136-,137-,152-,153-,154-,155-,156-/m0/s1
    Show/Hide
InChIKey
DLSWIYLPEUIQAV-CCUURXOWSA-N
Physicochemical Property
logP
-11.62786
Rotatable Bonds
149
Heavy Atom Count
291
Polar Areas
1646.18
Hydrogen Bond Donor Count
57
Hydrogen Bond Acceptor Count
56
Complexity
291

"RO5" indicates the cutoff set by lipinski's rule of five:

(1) Molecular weight less than 500 Dalton;

(2) xlogp less than 5;

(3) No more than 5 hbonddonor (Hydrogen Bond Donor Count);

(4) No more than 10 hbondacc (Hydrogen Bond Acceptor Count);

(5) No more than 10 rotbonds (Rotatable Bond Count).

    Show/Hide
Click to Show/Hide the External Link(s) of This Compound
PubChem ID
CID: 56843331
ChEMBL ID
CHEMBL3616752
DrugBank ID
DB13928
Map of Molecular Bioactivity Related to the Compound
Map of Molecular Bioactivity Related to the Compound

Compound
Cell Line
Protein

Bioactivity Value:

<= 0.1 μM
> 0.1 μM and <= 10 μM
> 10 μM
Imprecise Activity
Table of Molecular Bioactivities Related to the Compound
Protein ID: PT01356, Glucagon-like peptide 1 receptor
Cell-based Assay
Cell Line ID Cell Line Name Cell Line Organism
CL000026 CHO-K1 Cricetulus griseus (Chinese hamster)  2
1
EC50 = 0.0187 nM
   TI
   LI
   LO
   TS
2
EC50 = 0.0192 nM
   TI
   LI
   LO
   TS
CL000051 BHK-21 Mesocricetus auratus (Golden hamster)  2
1
IC50 = 0.13 nM
   TI
   LI
   LO
   TS
2
IC50 = 30 nM
   TI
   LI
   LO
   TS
Biochemical Assays
1 Kd = 441 nM
Clinical Information about the Compound
Drug 1 ( Semaglutide )
Drug Name Semaglutide
Company Novo Nordisk
Indication
Type-2 diabetes
Phase 3
Target(s)
Glucagon-like peptide 1 receptor (GLP1R)
Agonist