Python character extraction from code base

Given the code base (for example, a large C project or Objective-C), I would like to analyze the source code files and select the characters of interest. These can be class declarations, variable names, or types or method names. Is there a Python module that could help me with this?

The only approach I see in the future is to use regular expressions to collect these characters, but I think it can get very ugly very quickly. I am also not an expert in compilers or parsers, so lighter weight would be preferable.

Thanks for any suggestions.

------ update -----

Thanks for all the suggestions so far, definitely some promising leaders. Another possibility that might be possible: what if I could compile a project that I was trying to analyze. Can debugging symbols (dsym) simplify this process? I'm not looking for something advanced, just a list of classes with their ivar names and methods. At the moment, considering the analysis tools offered, it seems, more than I can afford to invest in this project right now.

+5
source share
3 answers

. , .

+1

langauge ( C , Objective C), langauge langauge, , . , , //. "".

, DMS Software Reengineeering Toolkit. DMS C Front End C . "", , AST, (, , typedef,...).

Python, .

+1

All Articles