ODIN, the Online Database of INterlinear Text (I know, I know, why not “ODIT”?) collects sentences from linguistics journals that have interlinear glosses.
They’ve got data on over a thousand languages–and probably most useful, they give you something grammars usually don’t–what ISN’T allowed (or at least what is claimed to be disallowed).
For example, from Broadwell (2002) on Amharic:
28) *wädä [[yä=Yohannïs]PP bet]]NP toward of=John house (`toward John's house')
29) wädä [[Yohannïs]PP bet]]NP toward John house `toward John's house'