General Information of the Compound
Compound ID
CP0417167
Compound Name
(4S)-5-[[2-[[(2S,3R)-1-[[(2S)-1-[[(2S,3R)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[2-[[(2S)-5-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-1-[[(2S,3S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[2-[[(2S)-1-[(2-amino-2-oxoethyl)amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-2-oxoethyl]amino]-1-oxohexan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-3-methyl-1-oxopentan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-4-carboxy-1-oxobutan-2-yl]amino]-1-oxohexan-2-yl]amino]-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-1,5-dioxopentan-2-yl]amino]-2-oxoethyl]amino]-4-carboxy-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-(4-hydroxyphenyl)-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-3-carboxy-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-2-oxoethyl]amino]-4-[[(2S)-2-[[(2S)-2-amino-3-(1H-imidazol-4-yl)propanoyl]amino]propanoyl]amino]-5-oxopentanoic acid
    Show/Hide
Synonyms
HAEGTFTSDVSSYLEGQAAKEFIAWLVKGRG-NH2
    Show/Hide
Structure
Formula
C151H229N41O46
Molecular Weight
3354.734
Canonical SMILES
CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)[C@@H](C)O)[C@@H](C)O)C(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(N)=O
    Show/Hide
InChI
InChI=1S/C151H229N41O46/c1-17-77(10)121(148(236)170-81(14)127(215)178-105(60-87-63-161-92-36-25-24-35-90(87)92)138(226)180-101(56-74(4)5)139(227)189-119(75(6)7)146(234)177-94(37-26-28-52-152)130(218)163-66-112(201)171-93(39-30-54-160-151(157)158)129(217)162-65-111(156)200)191-140(228)103(57-84-31-20-18-21-32-84)181-135(223)99(47-51-117(208)209)176-134(222)95(38-27-29-53-153)173-125(213)79(12)167-124(212)78(11)169-133(221)98(44-48-110(155)199)172-113(202)67-164-132(220)97(46-50-116(206)207)175-136(224)100(55-73(2)3)179-137(225)102(59-86-40-42-89(198)43-41-86)182-143(231)107(69-193)185-145(233)109(71-195)186-147(235)120(76(8)9)190-142(230)106(62-118(210)211)183-144(232)108(70-194)187-150(238)123(83(16)197)192-141(229)104(58-85-33-22-19-23-34-85)184-149(237)122(82(15)196)188-114(203)68-165-131(219)96(45-49-115(204)205)174-126(214)80(13)168-128(216)91(154)61-88-64-159-72-166-88/h18-25,31-36,40-43,63-64,72-83,91,93-109,119-123,161,193-198H,17,26-30,37-39,44-62,65-71,152-154H2,1-16H3,(H2,155,199)(H2,156,200)(H,159,166)(H,162,217)(H,163,218)(H,164,220)(H,165,219)(H,167,212)(H,168,216)(H,169,221)(H,170,236)(H,171,201)(H,172,202)(H,173,213)(H,174,214)(H,175,224)(H,176,222)(H,177,234)(H,178,215)(H,179,225)(H,180,226)(H,181,223)(H,182,231)(H,183,232)(H,184,237)(H,185,233)(H,186,235)(H,187,238)(H,188,203)(H,189,227)(H,190,230)(H,191,228)(H,192,229)(H,204,205)(H,206,207)(H,208,209)(H,210,211)(H4,157,158,160)/t77-,78-,79-,80-,81-,82+,83+,91-,93-,94-,95-,96-,97-,98-,99-,100-,101-,102-,103-,104-,105-,106-,107-,108-,109-,119-,120-,121-,122-,123-/m0/s1
    Show/Hide
InChIKey
FZWMHWVGNYURLN-AAEALURTSA-N
Physicochemical Property
logP
-14.88363
Rotatable Bonds
110
Heavy Atom Count
238
Polar Areas
1414.19
Hydrogen Bond Donor Count
50
Hydrogen Bond Acceptor Count
47
Complexity
238

"RO5" indicates the cutoff set by lipinski's rule of five:

(1) Molecular weight less than 500 Dalton;

(2) xlogp less than 5;

(3) No more than 5 hbonddonor (Hydrogen Bond Donor Count);

(4) No more than 10 hbondacc (Hydrogen Bond Acceptor Count);

(5) No more than 10 rotbonds (Rotatable Bond Count).

    Show/Hide
Click to Show/Hide the External Link(s) of This Compound
PubChem ID
CID: 44577329
SID: 15188658
ChEMBL ID
CHEMBL499930
Map of Molecular Bioactivity Related to the Compound
Map of Molecular Bioactivity Related to the Compound

Compound
Cell Line
Protein

Bioactivity Value:

<= 0.1 μM
> 0.1 μM and <= 10 μM
> 10 μM
Imprecise Activity
Table of Molecular Bioactivities Related to the Compound
Protein ID: PT01356, Glucagon-like peptide 1 receptor
Cell-based Assay
Cell Line ID Cell Line Name Cell Line Organism
CL000011 CHO Cricetulus griseus (Chinese hamster)  2
1
EC50 = 0.06 nM
   TI
   LI
   LO
   TS
2
Ki = 0.32 nM
   TI
   LI
   LO
   TS
CL000026 CHO-K1 Cricetulus griseus (Chinese hamster)  2
1
EC50 = 3.7 nM
   TI
   LI
   LO
   TS
2
IC50 = 1.7 nM
   TI
   LI
   LO
   TS
Protein ID: PT06275, Glucagon-like peptide 1 receptor
Cell-based Assay
Cell Line ID Cell Line Name Cell Line Organism
CL000127 INS-1 Rattus norvegicus (Rat)  1
1
EC50 = 0.9 nM
   TI
   LI
   LO
   TS
Clinical Information about the Compound
Drug 1 ( HAEGTFTSDVSSYLEGQAAKEFIAWLVKGRG-NH2 )
Drug Name HAEGTFTSDVSSYLEGQAAKEFIAWLVKGRG-NH2
Target(s)
Glucagon-like peptide 1 receptor (GLP1R)
Inhibitor