Skip to main content

Table 5 Top 10 most common LINGOs of each compound data set

From: A comparative study of SMILES-based compound similarity functions for drug-target interaction prediction

Enzyme/445

GPCR/223

LINGO

Num. of drugs

LINGO

Num. of drugs

c0cc

321

c0cc

180

(=O)

300

0ccc

170

0ccc

279

(=O)

117

C(=O

228

cccc

108

ccc0

197

ccc0

107

cccc

171

ccc(

94

)c0c

155

C(=O

87

@H](

149

)c0c

84

ccc(

144

Cc0c

78

[C@H

144

C(O)

72

Ion Channels/210

Nuclear Receptors/54

LINGO

Num. of drugs

LINGO

Num. of drugs

c0cc

165

(=O)

37

0ccc

148

[C@H

35

(=O)

130

C@H]

35

ccc0

116

C(=O

35

cccc

105

H]0C

35

C(=O

101

[C@@

35

)c0c

94

C@@H

35

ccc(

72

@@H]

35

O)c0

56

@H]0

35

=O)c

54

)[C@

34